SPECIAL SECTION 
Speciation 
INTRODUCTION 

727 Happy Birthday, Mr. Darwin 


REVIEWS 


728 The Red Queen and the Court Jester: 
Species Diversity and the Role of Biotic 
and Abiotic Factors Through Time 
M. J. Benton 


732 Adaptive Radiation: Contrasting Theory 
with Data 
S. Gavrilets and J. B. Losos 


737 Evidence for Ecological Speciation and 
Its Alternative 
D. Schluter 


741 The Bacterial Species Challenge: Making 
Sense of Genetic and Ecological Diversity 
C. Fraser et al. 


746 _ |s Genetic Evolution Predictable? 
D. L. Stern and V. Orgogozo 


>> For a list of all related content, see p. 727 or go to 
www.sciencemag.org/darwin/ 


EDITORIAL 


687 Species Uncertainties 
Robert M. May and Paul H. Harvey 
>> Speciation section p. 727 


NEWS OF THE WEEK 


696 Agencies Sweat the Details of Spending 
Billions More on Science 


697 New Ph.D.s to Teach Harvard Undergrads 


698 California Researchers Chilled by Sudden 
Freeze on Bond Funds 
Stem Cell Institute Looks for New Ways to Raise Cash 


699 Life Scientists Cautious About Dual-Use 
Research, Study Finds 


COVER 


700 Some Neglected Diseases Are More 
Neglected Than Others 


701 Invisibility Umbrella Would Let Future 
Harry Potters See the Light 


701 From the Science Policy Blog 


NEWS FOCUS 


702 Polio: Looking for a Little Luck 
Rethinking the Polio Endgame 


706 Evolutionary Biology: Agreeing to Disagree 


>> Speciation section p. 727 


709 ORIGINS 
On the Origin of Art and Symbolism 


>> Speciation section p. 727; Science Podcast 


LETTERS 


713 Keeping Raw Data in Context 
I. Sim et al. 
The IRB Is Key 
W. R. Lovallo 
Response 
J. A. List 
Implications of Ancient Ice 
S. Zimov 
Response 
D. G. Froese et al. 


714 CORRECTIONS AND CLARIFICATIONS 
714 TECHNICAL COMMENT ABSTRACTS 


BOOKS ET AL. 


716 Why Evolution Is True 
J. A. Coyne, reviewed by M. Pigliucci 


717 Perspectives in Animal Phylogeny 
and Evolution 
A. Minelli, reviewed by W. Arthur 


POLICY FORUM 
718 Biologically Reversible Exploration 
C. PR. McKay 


Charles Darwin's abiding interest in the classification of 
barnacles, illustrated here, was one of the inspirations for his 
investigations into the origins and diversification of species. 
Science marks Darwin's 200th birthday with a special section on 
speciation beginning on page 727. 
Image: Illustrations by George Sowerby, reproduced with permission 
from The Complete Work of Charles Darwin Online, John van Wyhe, Ed. 
(http://darwin-online. org.uk); background: iStockphoto.com 


page 709 


PERSPECTIVES 


719 


720 


721 


723 


724 


725 


Forming Massive Stars 
B. Whitney 
>> Report p. 754 


Time's Stamp on Modern Biogeography 
j. A. Crame 
>> Report p. 767 


Computational Social Science 
D. Lazer et al. 


Moonlighting in Mitochondria 
M. G. Myers Jr. 
>> Report p. 793 


Heavy Metals or Punk Rocks? 
R. J. Bodnar 
>> Report p. 764 


Confined Polymers Crystallize 
Pj. Lemstra 
>> Report p. 757 


CONTENTS continued >> 


DEPARTMENTS 


683 This Week in Science 

688 Editors’ Choice 

690 = Science Staff 

693 Random Samples 

695 Newsmakers 

807 _ Information for Contributors 
809 New Products 

810 Science Careers 


www.sciencemag.org SCIENCE VOL323 6 FEBRUARY 2009 


Published by AAAS 


677 


BREVIA 


753 The Sea-Level Fingerprint of West Antarctic 
Collapse 
J. X. Mitrovica et al. 
An uneven pattern of local sea-level rise 
would result from the collapse of the West 
Antarctic Ice Sheet. 


REPORTS 
754 The Formation of Massive Star Systems 
by Accretion 


M. R. Krumholz et al. 

Numerical simulations show that instabilities 
can channel gas into growing massive stars 
despite their high radiation pressure. 

>> Perspective p. 719 


757 Confined Crystallization of Polyethylene 
Oxide in Nanolayer Assemblies 
H. Wang et al. 
Squeezing ultrathin polymer layers can 
induce crystallization and decrease their 
gas permeability. 
>> Perspective p. 725 


760  Nitrogen-Doped Carbon Nanotube 
Arrays with High Electrocatalytic Activity 
for Oxygen Reduction 
K. Gong et al. 

Carbon atoms in nanotubes are activated 
by nitrogen doping and catalyze oxygen 
reduction at high pH. 


764  Anomalously Metal-Rich Fluids Form 
Hydrothermal Ore Deposits 
J.J. Wilkinson et al. 
The metal contents of the fluid inclusions 
that create lead-zinc ores have been 
drastically underestimated. 
>> Perspective p. 724 


767 — Signature of the End-Cretaceous Mass 
Extinction in the Modern Biota 
A. Z. Krug et al. 
Increased rates of marine bivalve speciation 
after the Cretaceous-Tertiary extinction were 
most marked in the tropics. 
>> Perspective p. 720 


771 A Great-Appendage Arthropod with a 
Radial Mouth from the Lower Devonian 
Hunsriick Slate, Germany 
G. Kihl et al. 

Fossil evidence for the great-appendage 
arthropods extends to more recent periods 
by about 100 million years. 


773 Two Thresholds, Three Male Forms Result 
in Facultative Male Trimorphism in Beetles 
J. M. Rowland and D. J. Emlen 
A clade of dung beetles can alter their 
reproductive tactics by switching between 
three male forms. 


776 


779 


782 


785 


789 


793 


797 


800 


802 


Sequential Sympatric Speciation Across 
Trophic Levels 
A. A. Forbes et al. 


A host shift by a fly is causing its wasp parasite 
to undergo changes leading to speciation. 


Evolution of the Drosophila Nuclear Pore 
Complex Results in Multiple Hybrid 
Incompatibilities 

S. Tang and D. C. Presgraves 

Nuclear pore complex proteins are involved 
in lethal gene interactions in Drosophila 
melanogaster-D. simulans species hybrids. 


Queen Ants Make Distinctive Sounds That 
Are Mimicked by a Butterfly Social Parasite 
F. Barbero et al. 

Sounds made by queen ants denoting rank are 
copied by the larvae and pupae of a parasitic 
butterfly, facilitating their infiltration into 
ant colonies. 


Stability Predicts Genetic Diversity 

in the Brazilian Atlantic Forest Hotspot 
A. C. Carnaval et al. 

Modeling ecological niches under past 
climates establishes a method to improve 
prediction and protection of tropical 
biodiversity hotspots. 


Orci Controls Centriole and Centrosome 
Copy Number in Human Cells 

A. S. Hemerly et al. 

A DNA replication factor ensures that only two 
centrioles are formed during cell division in 
animal cells. 


Function of Mitochondrial Stat3 in Cellular 
Respiration 

J. Wegrzyn et al. 

A protein that has a nuclear function as a 
transcription factor also functions to support 
respiration in the mitochondria. 

>> Perspective p. 723 


Platelets Kill Intraerythrocytic Malarial 
Parasites and Mediate Survival to Infection 
B. J. McMorran et al. 


Mouse and human platelets bind malarial- 
infected red cells and kill the parasite within. 


Changes in Cortical Dopamine D1 
Receptor Binding Associated with 
Cognitive Training 

F. McNab et al. 

Training of working memory alters brain 
biochemistry by changing the density of 
cortical dopamine receptors. 

>> Science Podcast 


Axon Regeneration Requires a Conserved 
MAP Kinase Pathway 

M. Hammarlund et al. 

Axon regeneration in adult nematode worms 
has distinct signals that are independent of 

earlier developmental programs. 


CONTENTS continued >> 


www.sciencemag.org SCIENCE VOL323 6 FEBRUARY 2009 


Published by AAAS 


page 721 


page 737 


page 771 


CONTENTS 


679 


CREDITS: (SCIENCE SIGNALING) CHRIS BICKEL; (SCIENCE CAREERS) SHAI MEIRI 


SCIENCEONLINE 


CONTENTS 


SCIENCEXPRESS 


www. sciencexp ress.org 


Molecular and Evolutionary History of Melanism 
in North American Gray Wolves 

T. M. Anderson et al. 

Black coat color in wolves derives from a mutation 
that originated in domestic dogs. 
10.1126/science.1165448 

>> Science Podcast 


Blue or Red? Exploring the Effect of Color 
on Cognitive Task Performances 

R. Mehta and R. Zhu 

Blue favors creativity in humans, whereas red 


improves attention to detail. 
10.1126/science.1169144 


Dynamic Order-Disorder in Atomistic Models 

of Structural Glass Formers 

L. O. Hedges et al. 

The liquid-to-glass transition of a simple mixture is 
controlled by parameters that drive the system away 
from equilibrium. 

10.1126/science.1166665 


Implications of a VLBI Distance to the 

Double Pulsar J0737-3039A/B 

A. T. Deller et al. 

Tests of gravity theories that use this double pulsar, 
now recognized as being more distant, will be more 


precise. 
10.1126/science.1167969 


TECHNICALCOMMENTS 


Comment on “Human-Specific Gain of Function 
in a Developmental Enhancer” 
L. Duret and N. Galtier 


full text at www.sciencemag.org/cgi/content/full/323/ 
5915/714c 


Response to Comment on “Human-Specific Gain 
of Function in a Developmental Enhancer” 
S. Prabhakar et al. 


full text at www.sciencemag.org/cgi/content/full/323/ 
5915/714d 


SCIENCENOW 
www.sciencenow.org 
Highlights From Our Daily News Coverage 


Test Tube Babies Shed Light on 

Nature Versus Nurture 

Children born to genetically unrelated mothers 
provide clues to origins of antisocial behavior. 
Turtles Island-Hopped Their Way Across a 
Warm Arctic 

Widespread volcanism and tropical temperatures 
helped ancient turtles migrate to America. 
Titan’s Methane Mystery 


Scientists suspect liquid hydrocarbon reservoirs lurk 
beneath the Moon’s surface. 


SCIENCESIGNALING 
www.sciencesignaling.org 
The Signal Transduction Knowledge Environment 


PERSPECTIVE: Extracellular ATP in the Immune 
System—More Than Just a “Danger Signal” 

A. Trautmann 

Extracellular ATP is an important modulator 

of immune responses. 


PERSPECTIVE: Bistable Switches for Synaptic 
Plasticity 

H. Ogasawara and M. Kawato 

Nearly a decade intervened between theoretical 
predictions and experimental verification of the 
mechanisms underlying synaptic plasticity. 


PODCAST 

P.P. Pandolfi and A. M. VanHook 

Inactivating mTOR signaling in the adult mouse 
prostate decreases the incidence of cancer. 


E-LETTER: Microarrays Need Phylogenetics 

M. Abu-Asab 

Interpreting microarray experiments might benefit 
from evolution-compatible analytical tools. 


SCIENCECAREERS 
www.sciencecareers.org/career_magazine 
Free Career Resources for Scientists 


February 2009 Funding News 
J. Fernandez 


Learn about the latest in research funding, 
scholarships, fellowships, and internships. 


Darwin's Legacy 1: Rich Collections, 
Deep Expertise 
S. Carpenter 


Natural history museums have reasserted their 
scientific relevance in recent years. 


Darwin's Legacy 2: Keeping Order 
K. Travis 
London entomology curator Erica McAlister is 


responsible for everything from museum collections 
to fieldwork. 


Darwin's Legacy: Resources 

Science Careers Staff 

Here is a collection of resources related to careers 

in the H.M.S. Beagle’s wake. 

>> Speciation section p. 727; www.sciencemag.org/darwin/ 


SCIENCEPODCAST 
www.sciencemag.org/multimedia/podcast 
Free Weekly Show 

Download the 6 February Science Podcast to hear 
about the biochemistry behind better short-term 
memory, the genetics of wolf coat color, the origin 
of art, and more. 


ORIGINSBLOG 

blogs.sciencemag.org/origins 

A History of Beginnings 

>> Speciation section p. 727; www.sciencemag.org/darwin/ 


SCIENCEINSIDER 
blogs.sciencemag.org/scienceinsider 
Science Policy News and Analysis 


SCIENCESIGNALING 
Dynamic synapses. 


SCIENCECAREERS 
Career opportunities via Darwin. 


SCIENCE (ISSN 0036-8075) is published weekly on Friday, except the last week 
in December, by the American Association for the Advancement of Science, 1200 
New York Avenue, NW, Washington, DC 20005. Periodicals Mail postage (publica- 
tion No. 484460) paid at Washington, DC, and additional mailing offices. Copyright © 
2009 by the American Association for the Advancement of Science. The title SCIENCE is 
a registered trademark of the AAAS. Domestic individual membership and subscription 
(51 issues): $146 ($74 allocated to subscription). Domestic institutional subscription 
(51 issues): $835; Foreign postage extra: Mexico, Caribbean (surface mail) $55; other 
countries (air assist delivery) $85. First class, airmail, student, and emeritus rates on 
request. Canadian rates with GST available upon request, GST #1254 88122. Publica- 
tions Mail Agreement Number 1069624. Printed in the U.S.A. 


Change of address: Allow 4 weeks, giving old and new addresses and 8-digit account 
number. Postmaster: Send change of address to AAAS, P.O. Box 96178, Washington, DC 
20090-6178. Single-copy sales: $10.00 current issue, $15.00 back issue prepaid 
‘includes surface postage; bulk rates on request. Authorization to photocopy material 
for internal or personal use under circumstances not falling within the fair use provisions 
of the Copyright Act is granted by AAAS to libraries and other users registered with the 
Copyright Clearance Center (CCC) Transactional Reporting Service, provided that $20.00 
per article is paid directly to CCC, 222 Rosewood Drive, Danvers, MA 01923. The identifi- 
cation code for Science is 0036-8075. Science is indexed in the Reader's Guide to Period- 
ical Literature and in several specialized indexes. 


MN AAAS 


ADVANCING SCIENCE, SERVING SOCIETY 


www.sciencemag.org SCIENCE VOL323 6 FEBRUARY 2009 


Published by AAAS 


681 


CREDITS (TOP TO BOTTOM): J. M. ROWLAND AND D. J. EMLEN; KRUMHOLZ ET AL. 


Protective Film Production? 


For crystallization controlled by nucleation events, 
impurities and boundaries can influence the free 
energy of a system, and thus drive the crystalliza- 
tion toward specific morphologies or crystalline 
forms. The crystallization of polymers occurs 
through the repeated folding of chains into lamel- 
lae and so boundaries also confine chain motion. 
Wang et al. (p. 757; see the Perspective by 
Lemstra) made a series of polymer films through 
a multiple folding process generating extremely 
thin, alternating layers of two different polymers, 
only one of which will crystallize. As the layer 
thickness was decreased, a preferred orientation 
for the crystals developed. Initially, this showed 
fold surfaces parallel to the interface layers, but, 
for very narrow layers, the lamellae aligned paral- 
lel to the interfaces. The crystals in these layers 
looked like single crystals, and presented a much 
tougher barrier for gas diffusion. This approach 
could potentially generate excellent protective 
films from common polymeric materials. 


Legacy of Mass Extinctions 


Mass extinctions restructure taxonomic composi- 
tion of biota on a global scale. However, their 
legacy, in terms of subsequent patterns of evolu- 
tionary diversification, is poorly understood. Krug 
et al. (p. 767; see the Perspective by Crame) 
investigated how the end-Cretaceous (KT) mass 
extinction altered temporal spatial diversifica- 
tion dynamics. Marine bivalves provide excellent 
fossil records and a well-documented modern 
biogeography. Long-term shifts in the rate of 
species origination were identified for the cohort 
of bivalve genera residing in modern oceans, 
which show a clear and sustained increase in 
diversification following the KT event. For 
bivalves, virtually all of the world’s bioprovinces 


EDITED BY STELLA HURTLEY 


<< Beetle Drive 


Males of some animal species can adopt alternative repro- 
ductive strategies. For example, species may have two 
types of male, one of large size and horns that guard 
females, and one of smaller size that lacks horns and 
sneaks copulation. Such facilitative alternative pheno- 
types are thought to be regulated by a single develop- 
» mental threshold. However, Rowland and Emlem 
(p. 773) show that dung beetles may have several 
qualitatively different threshold mechanisms that 
each regulate the expression of the same trait and 
affect the reproductive tactic used. Furthermore, in a sur- 
prising number of beetles, two thresholds influenced a single 
phenotype—the horn—suggesting that facultative male trimorphism occurs 
among a wide range of beetle species. 


q 


today bear the signature of the KT mass extinc- 
tion despite the climatic, tectonic, and biotic 
events of the intervening 65 million years. 


No Size Too Large 


Radiation pressure is thought to halt the forma- 
tion of stars with masses greater than 20 times 
that of the Sun, yet, somehow, such stars are 
known to exist. The radiation pressure produced 
by the light emitted by these stars is 
stronger than their gravitational 
attraction, preventing them 
from accreting gas from their 
natal cloud. Krumholz et al. 

(p. 754, published online 15 
January; see the Perspective 

by Whitney) present three- 
dimensional radiation-hydrody- 
namic simulations of the formation 
of massive stars, which show that the 

instabilities driven by the interaction between 
gravity and radiation allow accretion to continue 
up to unlimited masses. These instabilities also 
produce fragmentation, leading to a high propor- 


tion of multiple star systems among massive stars. 


Oxygen Reduction at 
Doped Carbon Nanotubes 


Fuel cells catalyze the reduction of oxygen with 
platinum, but cost and scarcity of platinum have 
led to a search for other catalysts with similar 
characteristics. Carbon-based catalysts are one 
alternative, and Gong et al. (p. 760) report that 
carbon nanotubes, when doped with nitrogen 
atoms and grown as vertically aligned arrays, 
catalyze the oxygen reduction reaction in highly 
alkaline solutions with activities comparable to 


that of platinum. The high activity is attributed to 
the electron-accepting ability of the nitrogen 
centers creating a net positive charge on adjacent 
carbon atoms. 


Awesome Ores 


Ore deposits represent unusually high concentra- 
tions of metals or other minerals, often deposited 
from a fluid. A major question toward under- 
standing their origins is the concentrations in 
the fluids prior to deposition. Until 
recently, it has not been possible to 
measure metal concentrations in the 
tiny fluid inclusions directly within 
most ore minerals. Wilkinson et al. 
(p. 764; see the Perspective by Bodnar) 
have now measured this concentration 
directly in two lead-zinc deposits. Metal 
concentrations were orders of magnitude 
higher than predicted from solubility arguments 
or from data on accessory minerals. 


Great-Appendage 
Arthropods 


The “great appendage” arthropods, which lie near 
the base of the arthropod phylogenetic tree, are 
characterized by a large clawed limb at the front 
of the head and have only been found in the 
Burgess Shale—-type deposits of the Cambrian. 
Kiihl et al. (p. 771) report the discovery of a 
much more recent great appendage arthropod 
from 100 million years after the Cambrian, pre- 
served in the Hunsriick Slate, the famous pyritized 
fossil deposit near Bundenbach in Germany. This 
arthropod is a kind of chimera, which retains 
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primitive characters in an otherwise more advanced arthropod morphology. Thus, great appendage 
arthropods persisted for much longer than previously assumed, and their absence in the fossil record 
probably reflects a lack of suitable preservation sites. 


Counting Centrosomes 


Accurate chromosome segregation during eukaryotic cell division requires a microtubule spindle, forma- 
tion of which in animals is driven by centrosomes. In order to maintain genome stability, centrosomes 
must be duplicated once every cell division cycle along with the genomic DNA. Hemerly et al. (p. 789) 
show that Orc1, a component of the Origin Recognition Complex (ORC) essential for DNA replication, 
also acts at centrosomes to control their copy number. Orc1 inhibits Cyclin E-dependent reduplication of 
the centrosome, and its localization to centrosomes is directed by the Orc1-binding partner Cyclin A. 


Social Climbing 


Many ant colonies are infiltrated by specialized social para- 
sites that penetrate unharmed to feed on the rich resources 
concentrated within ant nests. Barbero et al. (p. 782) show 
that the larvae and pupae of a large blue (Maculinea) but- 
terfly social parasite mimic the distinctive sounds made by 
their host ant’s queen, eliciting similar enhanced care from 
the workers. The queen ants make different sounds from the 
workers in ant societies, and the workers behave more pro- 
tectively when they hear a queen’s sounds. Thus, the para- 
site is afforded better care and higher social status. 


Little Killers 


Platelets plug damaged blood vessels when activated by exposure to collagen, and can also adhere to 
and promote the sequestration of malaria parasites within narrow peripheral blood vessels, contribut- 
ing to cerebral pathology. Their generally sticky nature also means that platelets are valuable 
adjuncts to front-line defenses against early stages of pathogen attack. McMorran et al. (p. 797) now 
show that, despite their pathological role later in infection, platelets are important to the innate host 
responses against the initial stages of malaria. Platelet inhibitors, including aspirin, remove the lethal 
effects of platelets on the malaria parasite Plasmodium falciparum in vitro. Mice engineered for 
platelet deficiency succumbed more rapidly to infection with a similar parasite Plasmodium chabaudi. 
In both cases the platelets stuck to the infected red blood cells and killed the parasite. 


Working Memory and Dopamine 


Working memory—the ability to retain information for short periods of time—is important for a wide 
range of cognitive functions. Dopaminergic neurotransmission plays a central role in working mem- 
ory. McNab et al. (p. 800) investigated whether cognitive training changes the density of cortical 
dopamine D1 receptors and subcortical dopamine D2 receptors. Intensive training in volunteers 
induced an increase in working memory capacity, which correlated with changes in D1 but not in D2 
receptor binding potential. These training-induced changes indicate an unexpectedly high level of 
plasticity of the human cortical dopamine D1 system and emphasize the mutual interdependence of 
behavior and the underlying brain biochemistry. 


Axonal Regeneration in Nematodes 


During nervous system development, a huge number of axons reach out toward their connections. In the 
adult, however, such outgrowth is much less common. When an adult nerve is injured, neurons are 
sometimes able to grow out new axons, but in humans, this imperfect process occurs more readily for 
nerves of the peripheral nervous system. Hammarlund et al. (p. 802, published online 22 January) now 
shed some light on the molecular tools that support axon regeneration in the nematode, Caenorhabditis 
elegans, and differentiate the process from normal development. In worms in which a genetic fault 
leads to excessively brittle axons, or in which a laser is used to cut axons, regeneration of the axon from 
the remaining stump depended upon a key kinase pathway. This kinase functions early in the road to 
regeneration, but does not seem to be involved during initial development of the axons. 
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| FDITORIAL 


Species Uncertainties 


APPROPRIATELY FOR A YEAR CELEBRATING DARWIN, THIS ISSUE OF SCIENCE HAS A SPECIAL 
section that surveys our current understanding of speciation processes (p. 727). The section 
illustrates Dobzhansky’s memorable injunction that nothing in biology makes sense except 
in the light of evolution. 

You cannot, however, even begin to make sense of biodiversity until you have some system- 
atic sense of what is actually there. This task began, astonishingly, a full century after Newton 
and his predecessors, with the 10th edition of Linnaeus’ De Rerum Naturae (1758). The legacy 
of this unhappy quirk lingers with us today. Take the U.S. Library of Congress—its Web site 
indicated that, as of December 2008, the library held 20,854,816 cataloged books. Yet we do 
not know, to within 10% or so, how many distinct eukaryotic species have been named and 
recorded (the species concept arguably needs to be interpreted differently for prokaryotes). 
This derives partly from the lack of synoptic and coordinated catalogs 
for many invertebrate taxa and partly from resulting unresolved syn- 
onymies: the same species separately identified and named on two or 
more occasions. The total number of known eukaryotic species is cur- 
rently estimated as ~1.8 to 1.9 million, but with all the synonymies 
removed it may be 1.6 million or fewer. Bird and mammal species are 
known very well and most other vertebrates reasonably well. Some- 
where around 90% of plant species are probably known. But our knowl- 
edge of invertebrate species (insects, helminths, and others) is woefully 
inadequate. So credible estimates of the true eukaryotic species total run 
around 5 to 10 million, but suggestions as low as 3 million or as highas | 
100 million can be defended. 

Given these uncertainties, we can say even less about the num- 
bers of species likely to be threatened with extinction. Assessments 
by the International Union for Conservation of Nature (IUCN) sug- 
gest that ~20% of all mammal species and 12% of all birds are seriously 
endangered. But our ignorance about even those insect species that have been recorded is 
highlighted by the fact that only 0.06% of them feature in the IUCN Red List of Threat- 
ened Species. It is nevertheless possible to estimate the rate of species extinctions over the 
past century as compared with average rates over the 550-million-year span of the fossil 
record by assuming that well-known groups, such as birds and mammals, are representa- 
tive. This suggests that recent extinction rates have been 100 to 1000 times faster than 
average. And under further stresses from habitat destruction, alien introductions, and 
overexploitation, all exacerbated by climate change, a further 10-fold increase is foreseen 
over the coming century. This puts us on the breaking tip of a sixth great wave of mass 
extinctions, differing from the previous Big Five in that it is associated with our activities 
rather than with environmental events. 

A better understanding of speciation processes, and of species themselves and their ecolog- 
ical roles, is not just an academic exercise. The recent Millennium Ecosystem Assessment made 
a comprehensive appraisal of the condition of, and trends in, the world’s ecosystem services: 
benefits provided to humans as a result of species’ interactions within ecosystems. It identified 
24 categories of such services, finding that 15 are being degraded or used unsustainably, 4 have 
been enhanced, and the remaining 5 are unevaluatable. A separate study assessed the value of 
these services as being comparable with conventionally assessed gross domestic product. Most 
of these ecosystem services are delivered by invertebrate species, which are largely unknown to 
us. One consequence is that our growing understanding of how ecosystems are structured and 
function remains insufficient to answer the question of whether we will be able to engineer 
repairs to, or substitutes for, degraded ecosystem services. 

Other initiatives, long overdue, are speeding up the rate at which we are cataloging the library 
of living species. But concomitant growth in humanity’s ecological footprint, arguably to unsus- 
tainable levels, requires all these advances and more. Best of times, worst of times. 

— Robert M. May and Paul H. Harvey 


10.1126/science.1170937 
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U.S. BUDGET 


Bond crisis halts 
California projects 


Agencies Sweat the Details of 
Spending Billions More on Science 


It is a problem U.S. science agencies are 
delighted to have: how to spend billions 
of dollars on basic research over the next 
18 months in a way that does not cause 
headaches down the road. 

Although tax breaks and aid to belea- 
guered states are getting most of the media 
attention, President Barack Obama’s plan to 
stimulate the U.S. economy includes an 
unprecedented infusion of money into sci- 
ence. It reflects bipartisan support for the idea 
that research not only puts people to work but 
also increases the chances of creating new 
industries that will sustain economic growth. 
Most of the money would be spent on either 
bread-and-butter research grants 
to individuals and small groups or 
larger awards to rejuvenate labs, 
buy equipment, and build or reno- 
vate facilities at universities and 
federal laboratories. 

Agency officials say there are 
more than enough worthy ideas 
for putting the money to immedi- 
ate use. But the trick will be to 
avoid a crash when the stimulus 
money dries up. For infrastruc- 
ture projects, that means not let- 
ting capacity outstrip the govern- 
ment’s ability to support the 
researchers who will occupy the 
space. The challenge is even 
greater for grants, requiring a del- 
icate balancing of veteran and 
first-time grantees, consideration 
of the impact on underrepresented groups, 
and avoiding a bolus of new grants that expire 
at the same time and trigger a flood of appli- 
cation renewals. Any mistakes in managing 
either pot, say scientists from both the previ- 
ous and current Administrations, could desta- 
bilize the overall scientific enterprise. 

“T do think that money of the magnitude 
being proposed can be spent on useful 
things,” says John Marburger, who headed 
the White House Office of Science and Tech- 
nology Policy for the entire tenure of Presi- 
dent George W. Bush. “But it’s short-term 


money. The great danger is creating facilities 
that no one can afford to operate.” 

Harold Varmus, who advised the Obama 
campaign and who was named co-chair of 
the President’s Council of Advisers on Sci- 
ence and Technology in December, makes 
the same point about research grants. “Not 
everybody understands that grants create an 
obligation,” says Varmus, a former U.S. 
National Institutes of Health (NIH) director. 
“So the base is crucial. Obama talked repeat- 
edly during the campaign about gradual and 
consistent funding for science. Maybe part of 
this [stimulus] should go into the base.” 

Although the package is still being 


A leading light. The stimulus bill could speed up construction of Brookhaven’s 
$900 million National Synchrotron Light Source II. 


debated, the eventual figures for science 
promise to be huge. The $819 billion package 
approved last week by the House of Represen- 
tatives contains some $20 billion for research- 
related activities, including a $6 billion dollop 
for the Department of Education to distribute 
to states for all manner of repairs and renova- 
tions at colleges and universities. The plan 
now before the Senate offers a slightly differ- 
ent mix—more for research and less for infra- 
structure at NIH, less for both pots at the U.S. 
National Science Foundation (NSF), and less 
for Department of Energy (DOE) facilities. 


Security concerns 


change U.S. biology 


Obama, who has promised that three-fourths 
of the new, one-time spending will occur by 
the fall of 2010, has urged legislators to work 
out their differences in the next few weeks so 
that the money can start flowing. 

NSF, NIH, and DOE officials are taking 
different tacks in preparing for the windfall. 
NSF plans to do more of the same within both 
research and infrastructure. NIH hopes to add 
new twists to its research portfolio. And DOE 
is expected to accelerate construction, letting 
the future take care of itself. Here are high- 
lights from each agency. 


| NSF. The House bill would give NSF 
$2 billion for new research grants and 
$900 million for three different types of 
infrastructure programs, including reviving 
and expanding one for university projects. 
The Senate is offering less in both cate- 
gories—$1 billion and $350 million. 

Rather than asking for new proposals, 
Director Arden L. Bement says 
that program managers will dip 
into the pool of existing applica- 
tions submitted since 1 October 
2008, the start of the 2009 fiscal 
year, and fund those rated most 
highly by reviewers. “We have 
enough in our backlog to spend 
that [$2 billion] right now,” he 
says. The expansion will also 
allow NSF “to take care of young 
investigators,” he adds. 

Money for a multiyear grant 
(typically 3 years) would be 
committed up front, at the 
time of the award, rather than 
parceled out year by year. That 
will put pressure on investiga- 
tors “to spend it quickly rather 
than putting it into the bank,” 
says Bement. NSF estimates that the House 
package translates into 3000 additional 
awards that would employ 12,750 scien- 
tists, postdocs, and graduate and under- 
graduate students. Then there’s the impact 
on the economy of what’s bought with the 
grant, everything from reagents to high- 
end instrumentation. 

NSF also plans to stay the course on 
infrastructure spending. Bement says he will 
dust off and update the last solicitation for 
an identical but smaller academic modern- 
ization program that lapsed in 1997 if the 
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$200 million proposed by the 
House makes it through the 
legislative gauntlet. 


NIH. Acting NIH Director 
Raynard Kington says the 
agency has drawn up three 
ways to spend the grants’ por- 
tion of the windfall, which is 
$1.5 billion in the House ver- 
sion and $2.7 billion in the 
Senate. (Senator Arlen Specter 
(R-PA) has proposed bumping 
the Senate number to $9.2 bil- 
lion.)The first would be a new call for propos- 
als involving “topics in which there have been 
scientific or technical challenges” that might 
yield quick results with a blast of cash. 
Researchers would apply for accelerated 
review for awards of up to $500,000 a year for 
2 years. These challenge grants would fit well 
with NIH’s emphasis on new investigators and 
high-risk science, Kington says. 

NIH would spend some of the stimulus 
money on standard investigator grants (ROIs) 
that scored well in peer review last year but did- 
n’t get funded. The catch: The money would 
last only 2 years instead of the usual 4 years. 
Finally, Kington says, NIH might also add to 
the size of the awards made to investigators 
whose requested budgets were cut or who can 
identify “related research areas that might be 
meritorious.” Again, however, the supplement 
would probably be for only a couple of years. 

All of these strategies are aimed at avoid- 
ing what Kingston calls the “hard lesson” of 
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Growth industry? Congress may revive and expand 
an NSF program that funded research facilities like 
these greenhouses at the University of California, Davis. 


the 5-year doubling that left grantees with 
what Marburger describes as “a gargantuan 
appetite” that NIH hasn’t been able to fill. 
“There will be great pressure [to use the stim- 
ulus money] to fund more RO1s just as we 
currently fund them,” Kington says. “But we 
have to be careful that we do not do anything 
imprudent, and that includes setting our- 
selves up to have big commitments 2 years 
out” that can’t be met. 

The House bill would also give NIH $1.5 bil- 
lion to spend on repairs and improvements of 
extramural research facilities and for “shared 
instrumentation.” (The Senate would provide 
$300 million.) NIH is all set to handle awards 
like these and can do it quickly, Kington says. 
Both houses have proposed $500 million for 
renovations to the NIH campus. 


DOE. The big gap between what the House 
and Senate have proposed for DOE’s Office 
of Science—$2 billion versus $430 million— 


New Ph.D.s to Teach Harvard Undergrads 


Harvard University plans to hire up to 20 recent 
Ph.D.s to teach undergraduate courses in a 
move that officials say will improve instruction 
and help students facing a tough job market. 

The new College Fellows Program was 
announced last week in an e-mail to faculty and 
will go into effect this fall. Fellows will be paid 
$48,000 with full benefits to work in some 20 
academic departments throughout the Faculty 
of Arts and Sciences. The program is open to all 
recent—since 2005—Ph.D. graduates. The 
awards are for | year, with a second year possi- 
ble, and the money will come from the univer- 
sity’s instructional budget. 


“A large part of the goal was to support 
graduates and mentor excellent teaching 
among recent Ph.D.s, and [another] was to 
meet essential teaching needs in Harvard Col- 
lege,” says Allan Brandt, dean of Harvard’s 
Graduate School of Arts and Sciences. “We 
wanted to develop strong teaching for our Har- 
vard College students and to make sure our 
teaching needs were met.” 

Each fellow will be assigned a faculty 
mentor, and teaching-focused seminars are 
planned. The program, Brandt says, “is 
designed for people who have a deep interest 
in university teaching.” Fellows will be 
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is forcing DOE officials to 
hold their fire. Acting office 
head Patricia Dehmer would 
say only that her staffhas done 
“detailed homework” on how 
to implement the House’s 
orders to “support improve- 
ments to DOE laboratories 
and scientific facilities.” 

Her former boss, Raymond 
Orbach, says he assumes 
legislators “will split the 
difference.” The American 
Physical Society has drawn 
up a list of $1.6 billion in infrastructure or 
equipment improvements that could be 
jump-started within 4 months, and Orbach 
says he’s not afraid of accruing long-term 
mortgages. “The actual running cost of a 
facility is only 10% of its construction,” 
says Orbach. “These investments will create 
platforms that don’t exist anywhere else in 
the world. That’s an opportunity you can’t 
afford to pass up.” 

The wait is agonizing for individual proj- 
ect directors. Steven Dierker, who manages 
the just-approved National Synchrotron Light 
Source II at Brookhaven National Laboratory 
in Upton, New York, says adding $150 million 
to DOE’s request for $103 million this year 
would put hundreds more people to work, 
sooner. It would also save money, he adds, by 
shortening the 7-year construction schedule 
for the $912 million facility. 

—JEFFREY MERVIS 
With reporting by Eli Kintisch and Eliot Marshall. 


expected to carry 70% of the teaching load of 
a faculty member, leaving them some time to 
pursue their research. “At this career stage,” 
says Brandt, “it’s very important that they 
have some protected time to continue their 
research endeavors.” 

James Hanken, Alexander Agassiz profes- 
sor of zoology, says the program offers new 
Ph.D. recipients “a sort of a temporary hold” in 
a tough job market. “Tf it’s a teaching postdoc 
that doesn’t consume all of your waking hours 
and leaves you time to do some research, I think 
it can be a good deal,” he says. -SUSAN GAIDOS 
Susan Gaidos writes for Science Careers. 
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FINANCIAL CRISIS 


California Researchers Chilled by 
Sudden Freeze on Bond Funds 


For many California researchers, the bad 
news came just before the holidays: As a 
result of the state’s deteriorating financial 
situation, the department of finance was 
freezing billions of dollars in funding tied 
to the sale of state bonds. These funds had 
been slated for a vast variety of projects 
statewide, everything from supporting 
schools and public housing to building 
libraries and fixing freeways. But the freeze 
also pulled the plug—at least temporarily— 
on thousands of environmental research and 
conservation projects. In the ensuing weeks, 
scientists, graduate students, and nonprofit 
organizations have faced a mad scramble to 
replace what most had assumed was a secure 
source of funding. 

When the decision was announced on 
19 December, Rikk Kvitek, a marine ecolo- 
gist at California State University, Monterey 
Bay, got the call right before heading to his lab 
holiday party. “I had to walk in and say ‘Merry 
Christmas, and by the way, there’s no work 
tomorrow because virtually all of our funding 
is tied up in this.’ ” Kvitek is a principal inves- 
tigator on a $20 million state-funded sea- 
floor—mapping project that was just hitting 
full stride. “For the last 8 years, I’ve been 
really pushing the state to map all the state 
waters because we knew that it would really 
transform coastal marine research and how 
resource management was done,” he says. The 
high-resolution digital maps are being used, 
among other things, to help select sites for the 
state’s network of marine protected areas. 
Because of its scale, the project has become 
the primary focus of Kvitek’s work, and he 
says for the first time in 20 years he had all of 
his funding eggs in one basket. 

Like others, Kvitek was instructed to 
stop all work immediately on bond-funded 


projects. The next day, a 180-foot ship sur- 
veying the northern California coast pulled 
into port and has been out of commission 
ever since. Kvitek has managed to find tem- 
porary funding from a foundation to keep 
his 15 students and staff members working 
on data analysis, but he says that money will 
last only 2 or 3 months. 

California’s governor and legislature are at 
loggerheads about how to narrow a projected 
$40 billion budget deficit, and the state’s 
credit rating has tanked, scuttling its ability to 
sell bonds to raise capital. In more typical 
times, the state loans money to approved proj- 
ects and subsequently recoups the money by 
selling bonds. But now that the bonds aren’t 
selling, the state decided that the loans had to 
stop. Environmental projects are feeling the 
pinch in part because of the public support 


On hold. A sea floor—mapping study (left) and 
coastal prairie restoration project are among the 
victims of California’s bond fund freeze. 


they’ve received in recent years as California 
voters have passed propositions authorizing 
the state to sell bonds to fund projects to study 
and manage the state’s natural resources. (The 
state’s bond-funded stem cell institute has not 
yet felt the pain as acutely but is girding itself; 
see sidebar.) 

“This came totally out of the blue,” says 
Susan Williams, the director of the Bodega 
Marine Laboratory in Bodega Bay, adminis- 
tered by the University of California (UC), 
Davis. Many of the frozen projects were 
intended to help support California’s coastal 
economy, Williams says. “These are not pie- 
in-the-sky projects,” she says, citing ocean- 
monitoring projects used by fisheries man- 
agers and the U.S. Coast Guard, efforts to 
combat invasive species, and research on the 
impact of climate change. 

Some projects have been put on hold at a 
critical juncture. One example, Williams 
says, is a coastal prairie-restoration project 
run by the Bodega marine lab. This postcard- 
pretty habitat along the Sonoma County 
coast is being taken over by invasive grasses. 
The eradication plan called for a series of 
mowing and herbicide treatments followed 
by plantings of native species. The first treat- 
ment was completed last fall, but the funds 
are now frozen for the second round of grass 
removal, which should be happening now. 
“Unless the funding is restored and we can 
do the second removal, the seed bank of these 
weeds will germinate,” causing “an even 
bigger problem” than existed before the > 


Stem Cell Institute Looks for New Ways to Raise Cash 


In 2004, California voters authorized the state to raise $3 billion for stem cell research through 
the sale of bonds. Now, the state’s finances are in disarray and its bonds aren't selling, forcing the 
California Institute for Regenerative Medicine (CIRM) to look for new ways to raise cash. At a meet- 
ing last week in Burlingame, California, the institute’s leaders discussed alternatives and provi- 
sionally approved funding for $58 million in new training grants. 

Although the state’s recent decision to freeze funding on projects tied to bond sales has had a 
chilling impact on many researchers in the state (see main text), the freeze has not had a big 
immediate impact on CIRM, says Vice President for Operations John Robson. As of 1 January, the 
institute had $158 million on hand, thanks to proceeds from bond sales that took place before the 
recent economic nosedive. That should be enough to maintain operations and honor previous 
funding commitments for several months, Robson says: “We feel like we have enough money to 
carry us to about November.” 

But then what? CIRM estimates it would need to raise close to $136 million just to continue 
funding ongoing commitments through the end of 2010; it would need a total of $377 million to 
also fund projects that have already been given preliminary approval from its board and antici- 
pated new programs. At the meeting, CIRM Chairman Robert Klein proposed raising money by 
selling private placement bonds to wealthy investors interested in the potential societal benefits 
of stem cell research. -G.M. 
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program started, Williams says. 

The restoration project was also paying 
tuition, fees, and a stipend for Tawny Mata, an 
ecology graduate student at UC Davis. The 
funding freeze “has pretty much left me up in 
the air about how I’ll finish my Ph.D.,” Mata 
says. A teaching assistantship is paying her 
bills this quarter, but beyond that Mata isn’t 
sure how she’ll manage. “I’ve been contacting 
any professor I’ve ever worked with to see if 
they have any money lying around.” She’s not 
alone: A recent e-mail survey found that at 
least 24 out of about 160 students in her pro- 
gram had lost at least some funding. 

Across UC Davis, 60 projects received 
stop-work orders, says Jan Hopmans, chair of 
the university’s Department of Land, Air and 
Water Resources—20 in his department 
alone. “Many of these grants are for a few 


BIOSECURITY 


hundred thousand to a few million dollars,” 
Hopmans says. “We have 50 employees just in 
my department for whom we have in principle 
no funding at this time.” Researchers, stu- 
dents, and technicians have been reassigned to 
projects with other sources of funding where 
possible, Hopmans says, but so far 13 people 
have received layoff notices. 

Nonprofit groups are also feeling the pain. 
“A lot of our grantees are relatively small 
organizations, and some of them will go out 
of business if this goes on too long,” says 
Samuel Schuchat, executive officer of the 
Coastal Conservancy, the state agency 
charged with administering bond-funded 
grants for coastal research and conservation. 
One such program, the Invasive Spartina Pro- 
ject, an effort to eradicate invasive Spartina 
cordgrass from the San Francisco Bay, would 
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be especially painful to lose, say Schuchat 
and others. The state has already invested 
nearly $10 million in the project, which has 
reduced the area covered by the grass by 90% 
since 2006 and is on course to eradicate it by 
2012, says Peggy Olofson, the project’s direc- 
tor. Olofson has cobbled together money to 
run a scaled-down operation this year, but 
beyond that the future is uncertain. 

How long the bond funds will remain 
frozen is unclear, but all eyes are on Sacra- 
mento, where the governor and state legisla- 
tors are wrangling over how to close the 
budget gap—a necessary first step toward 
restoring the state’s credit rating and restor- 
ing its ability to sell bonds. Only then will 
those affected by the freeze be able to start 
thinking of a thaw. 

—-GREG MILLER 


Life Scientists Cautious About Dual-Use Research, Study Finds 


Some life scientists are changing the way they 
do business because of security concerns, 
according to a U.S. survey released this week. 

Researchers and policymakers in the 
United States have been hotly debating the 
need for new government regulations to pre- 
vent the misuse of life sciences research by 
terrorists and other bad actors. Even without 
such regulations, according to the survey, a 
few scientists are avoiding “dual use” research 
projects with the potential for harm; some are 
shying away from international collabora- 
tions; others are excluding foreign graduate 
students and postdocs from certain lines of 
work and censoring themselves while talking 
about their research. 

In all, 15% of the nearly 2000 life scientists 
who responded to the survey, conducted in late 
2007 by the National Research Council and 
AAAS (publisher of Science), reported having 
changed their behavior in one or more of those 
ways. “It is a surprisingly high number,” says 
study chair Ronald Atlas, a microbiologist at 
the University of Louisville in Kentucky. He 
finds it worrisome that security concerns may 
be impinging on the traditional openness of 
research in the life sciences. “What’s not clear 
is whether the community is overreacting or if 
this is an appropriate response,” Atlas says. 

The finding is also an implicit endorsement 
of the popular argument among academics for 
letting scientists police themselves on dual-use 
research rather than imposing government- 
mandated rules. The National Science Advi- 
sory Board for Biosecurity endorsed that self- 
governance approach in recommendations to 


Self-review. Some researchers are avoiding certain projects 
because of security concerns. 


the government in 2007, but federal officials 
have not yet decided what the policy should be. 
Richard Ebright, a chemist at Rutgers Uni- 
versity, New Brunswick, who has argued in 
favor of tougher regulations, says he finds the 
survey results “hard to believe,” given that 
previous studies have shown that most scien- 
tists in the community aren’t even aware of 
dual-use concerns. Ebright suspects that the 
survey, which was e-mailed to 10,000 life sci- 
entists who are members of AAAS, attracted 
an overwhelming proportion of responses 
from individuals who would “prefer not to see 
[government] regulations.” Atlas agrees that 
the survey may have captured “a biased group 
that had been thinking about this topic” and 
says that the findings “would require further 
verification from broader surveys.” 


The study authors say the sur- 
vey results point to the need for 
clearer guidelines on what kinds 
of research might have the poten- 
tial for dual use. “It’s possible that 
some life scientists are being over- 
cautious because there is no good 
definition of dual-use research,” 
Atlas says. Panelist Robert Cook- 
Deegan, a biosecurity expert at 
Duke University in Durham, 
North Carolina, says biosafety 
committees at some institutions 
are already working with their sci- 
entists to help evaluate the dual- 
use potential of research projects 
and respond accordingly. 

As an example, he cites a 
project led by Mark Denison of Vanderbilt 
University in Nashville, Tennessee, and 
Ralph Baric of the University of North Car- 
olina, Chapel Hill, that set out to make a 
SARS-like virus using synthetic biology 
techniques. The researchers “thought about 
dual use with their biosafety committees all 
along, and we did a half-day workshop 
before their publication to talk about what 
should not be included in the final publica- 
tion and why,” Cook-Deegan says. The paper 
was published in the 16 December 2008 
issue of the Proceedings of the National 
Academy of Sciences, with minor modifica- 
tions to the language and no data withheld. 
“Tt’s a really nice example of scientists tak- 
ing dual use seriously,” he says. 

—-YUDHIJIT BHATTACHARJEE 
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GLOBAL HEALTH 


Some Neglected Diseases Are More Neglected Than Others 


Ina bird’s-eye view of who’s fighting diseases 
among the poor, Ireland and the United States 
stand out for largess; Germany and Japan 
seem skimpy. As for targets, the battle against 
dengue is relatively well-funded, whereas 
pneumonia, meningitis, and diarrhea get 
mere table crumbs. The “big three”— 
HIV/AIDS, malaria, and tuberculosis— 
together soak up some 80% of the money. 
Those are some conclusions from the first 
global study tracking spending on research 
and development for “neglected diseases.” 
The authors, led by Mary Moran of the 
George Institute for International Health in 
Sydney, Australia, hope that the study will 
help grantmakers decide where best to put 
their money and spur countries into action. 
With the help of an expert panel, the team 
first decided which diseases currently suf- 
fered scientific neglect; to qualify, they had 
to occur disproportionately in low- or middle- 
income countries, and there had to be a need 
for new vaccines, drugs, or diagnostics that 
the market fails to address. The team then 
painstakingly tried to trace every grant or 


investment made in 2007 by governments, 
institutes, charities, and companies to 
address those needs. The paper, published in 
PloS Medicine this week, documents a total 
of $2.5 billion spent on 30 diseases. 
Government agencies pick up the biggest 
part of the tab, with the U.S. National Insti- 
tutes of Health good for 42% of the research. 
(The Bill and Melinda Gates Foundation 
comes in second at 18%.) Per capita, how- 
ever, Ireland is the biggest public spender. 
A comparison of the estimated global 
burden of each disease as expressed in dis- 
ability-adjusted life years (DALYs), a 
measure that aims to capture both disease 
and death, shows that R&D funders have 
flocked to the big three, while other dis- 
eases with an even bigger burden get little 
R&D attention. The accuracy of the DALY 
estimates, which the World Health Organi- 
zation last updated in 2004, is under con- 
stant debate. Moran explains that funders 
may have other reasons to pick a disease, 
such as the expectation that investments 
will pay off quickly; that’s why the team 
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mentioned DALYs only briefly in its report. 
Still, it’s clear that some diseases are under- 
funded, she says. 

Indeed, the data show that the big three no 
longer deserve the term “neglected,” says 
David Molyneux of the Liverpool School of 
Tropical Medicine in the United Kingdom, 
who calls the paper “impressive.” Molyneux 
says policymakers must “shift resources” to 
respiratory infections, diarrhea, and other 
underfunded diseases. Some can be con- 
trolled without new research, by scaling up 
current, and often cheap, methods. (See 
Three Q’s on p. 695.) 

The team, which has funding from the Bill 
and Melinda Gates Foundation to do the sur- 
vey annually through 2013, is already work- 
ing on next year’s edition. Next time, Moran 
hopes to include the contributions of four sig- 
nificant funders—India, China, and pharma 
giants Merck and Wyeth—that failed to sub- 
mit data this year. But she worries that the 
economic downturn may lead some funders 
to spend less where more is needed. 

—MARTIN ENSERINK 
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PHYSICS 


Invisibility Umbrella Would Let 
Future Harry Potters See the Light 


Two years ago, physicists blurred the distinc- 
tion between science and fiction by produc- 
ing a shell-like “invisibility cloak” that made 
both itself and an object inside it unde- 
tectable—albeit when viewed with micro- 
waves of a specific frequency. Now, a team 
from Hong Kong has gone 
one better with a theoreti- 
cal scheme for an “invisi- 
bility umbrella” that can 
make both itself and an 
object placed beside it 
disappear. The previous 
cloak ferried incoming 
light around the enclosed 
object, rendering it blind to 
the outside world. In con- 
trast, the umbrella leaves 
the hidden thing out in 
the open, so it can “see” 
its surroundings. 

It’s still a far cry from 
the magic garment that 
enables Harry Potter to 
sneak around Hogwarts 
Castle unseen and spy on 
others, but the concept 
draws rave reviews from 
other researchers. “It’s an 
absolutely brilliant idea,” 
says physicist Ulf Leon- 
hardt of the University of 
St. Andrews in the U.K., 
one of the pioneers of 
cloaking theory. 

In devising the scheme, Yun Lai, Che Ting 
Chan, and colleagues at the Hong Kong Uni- 
versity of Science and Technology in 
Kowloon melded two earlier approaches. In 
2005, Andrea Alt and Nader Engheta, electri- 
cal engineers at the University of Pennsylva- 
nia, predicted that researchers could make an 
object nearly invisible by coating it with a tai- 
lored layer of “metamaterial”—an assem- 
blage of metallic rods and C-shaped rings that 
interacts with electromagnetic radiation in 
novel ways—counteracting the thing’s ten- 
dency to scatter light. 

In contrast, in May 2006, Leonhardt and, 
independently, John Pendry of Imperial Col- 
lege London imagined stretching space so 
that the paths of light rays bend around an 
object. They then calculated how they might 
mimic that impossible stretching by sculpt- 


See? An object scatters light waves 
traveling from left to right (top). With 
its embedded antiobject, the invisibility 
umbrella counteracts the scattering, 
rendering both things invisible. 


ing the properties of a shell of metamaterial. 
Building on Pendry’s implementation of 
such “transformation optics,” experimenters 
David Schurig and David Smith at Duke 
University in Durham, North Carolina, 
produced a microwave cloak just 5 months 
later (Science, 20 Octo- 
ber 2006, p. 403). 

Chan and colleagues 
struck a middle ground. 
In theory and simula- 
tions, they first used 
transformation optics to 
perfectly cancel the scat- 
tering from a cylindrical 
post of ordinary material 
by coating it in meta- 
material. The researchers 
then realized that they 
could make an object 
near the now-invisible 
post disappear, too. 

The trick, they report in 
a paper to be published in 
Physical Review Letters, is 
to embed a matching “anti- 
object”—the metamaterial 
equivalent of a voodoo 
doll—in the outer layer of 
the post. The scattering 
from the embedded anti- 
object exactly cancels the 
scattering from the object, 
Chan says, “so it looks as 
if there is nothing there.” 
Because the hidden object remains outside 
the post or umbrella, it can detect light from 
its surroundings. 

The scheme does have limitations, Pendry 
notes. The umbrella works for only a single 
frequency, has to be specifically tailored to the 
object to be hidden, and won’t completely 
hide something that absorbs light. Still, 
Pendry says, “that’s carping on my part—it’s 
really a neat idea.” 

Making an invisibility umbrella even for 
microwaves may be challenging. It requires 
“left-handed” metamaterials that bend light 
in an especially strange way and are difficult 
to make. Still, Leonhardt says, “it’s clear that 
someone will do this in the future.” Given 
current progress, don’t be surprised if some 
wizard of an experimenter does it sooner 
rather than later. -ADRIAN CHO 


NEWS OF THE WEEK be 


Sciencelnsider 


From the Science 
Policy Blog 


As the global economy 
continues to melt down, scientists saw some 
bright spots. But the news wasn’t all good, 
and some of it was downright odd. Here are 
some highlights from Science’s science 
policy blog, Sciencelnsider: 


U.S. scientists saw more dollar signs 
this week when the U.S. Senate debated 
billions for federal research as part of 
a massive economic stimulus package. 
Sciencelnsider has a breakdown of who 
would get what. Still, some argue that 
money alone won't be sufficient to help 
young students who want to pursue scien- 
tific careers; an overhaul of the U.S. 
research system is needed. Things weren't 
as rosy north of the border: On 27 Janu- 
ary, the Canadian prime minister 
unveiled a budget that makes some big 
cuts to the country’s main source of 
research grants. Given the prospects for 
flush funding in the United States, Cana- 
dian scientists worry about a brain drain. 
Scientists in France are even more riled 
up, thanks to an incendiary speech by 
President Nicolas Sarkozy, which lam- 
basted the country’s research system as 
“infantilizing and paralyzing.” 

Also making headlines: two chemical 
elements. Officials at Los Alamos National 
Laboratory are scratching their heads over 
a mysterious case of beryllium contami- 
nation. Adding to the intrigue is the 
source of the harmful chemical: a part of 
the lab known as Technical Area 41. Mean- 
while, off dry land, scientists have started 
dumping 6 tons of iron into the Southern 
Ocean as part of a controversial geo- 
engineering project that aims to increase 
the ocean’s ability to absorb CO,. 

And finally, in a bit of bizarre news, an 
unnamed senior official at the U.S. National 
Science Foundation apparently viewed 
pornography on a work computer over 
2 years. The incident has caught the atten- 
tion of Senator Charles Grassley (R-IA), who 
now questions whether the agency deserves 
the $3 billion the U.S. House of Representa- 
tives has proposed giving it as part of the 
economic stimulus package. 


For the full postings and more, go to 
blogs.sciencemag.org/scienceinsider. 
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Polio: Looking for a Little Luck 


After another year of setbacks—and some real gains—the global polio 
eradication initiative continues with more support than ever 


GENEVA, SWITZERLAND—Soon after she 
took office 2 years ago, Margaret Chan called 
an urgent meeting. The time had come, said 
the new director-general of the World Health 
Organization (WHO), to take a hard look at 
the global polio-eradication initiative, which 
by then was 6 years past deadline, a couple of 
billion dollars over budget, and facing 
increasing questions about its feasibility from 
scientists and tapped-out donors. She wanted 
no more grand promises of when the virus 
would be vanquished from the planet. Instead, 
Chan and the “major stakeholders”—the part- 
ner organizations, donors, and countries— 
launched an “intensified” 2-year 
program, setting measurable mile- 
stones by which to judge progress. 
The leaders of the global initia- 
tive, a collaborative effort based at 
WHO, were to report back in Feb- 
ruary 2009, at which time the 
world could reassess its massive 
investment in the biggest global 
health program ever. 

That moment of reckoning is 
here, and the initiative has met 
only one of the milestones set 
2 years ago. At 1643, global polio 
cases in 2008 were actually higher 
than the 1315 total in 2007, and the 
virus remains entrenched in the 
last four countries where, for reasons both 
social and biological, it refuses to budge. 

Still, no one is talking about pulling the 
plug. If anything, the beleaguered program has 
garnered more support and more money than it 
did several years ago. Just last month, the Bill 
and Melinda Gates Foundation, Rotary Inter- 
national, and the U.K. and German govern- 
ments pledged $635 million for polio eradica- 
tion. “Polio has to succeed” is the widely 
voiced sentiment among Chan and other 
global health leaders, not only because of the 
huge investment—20-plus years and nearly 
$6 billion—but also because of the unsettling 
realization that there is no palatable way out 
(Science, 20 April 2007, p. 362). Stopping now, 
so close to the finish, would mean losing the 
spectacular gains of the past 20 years—a defeat 
that would certainly be the death knell for other 
potential global eradication projects, like those 


push it 
over the line 
if | have to.” 


—BRUCE AYLWARD 


for malaria or measles, says Peter Wright, an 
infectious-disease expert at Dartmouth College 
who advises the eradication initiative. And that 
is a decision no one is yet ready to make. 

That leaves the Global Polio Eradication 
Initiative (GPEI)—a collaboration led by 
WHO, Rotary International, UNICEF, and the 
US. Centers for Disease Control and Preven- 
tion—trying every trick in the book to beat the 
virus into submission. GPEI has a new 5-year 
plan that calls for reaching those unmet mile- 
stones—and far more. The program is invest- 
ing heavily in research on improved vaccines 
that earlier program leaders swore would 


“We won't 
let up. | will 
personally 
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never be necessary; the virus would be gone 
by the time one could be developed. In a major 
departure, they are rethinking whether the 
world can ever safely stop vaccinating against 
polio, the fundamental assumption on which 
polio eradication was sold some 20 years ago 
(see sidebar, p. 705). And perhaps most of all, 
they are hoping for a little luck. 


Stalemate 

For a long time, it looked like the polio war- 
riors had the virus licked. Soon after the pro- 
gram was launched in 1988, with the confi- 
dent prediction that polio would be gone by 
2000, the program began dispatching virus 
from more than 100 countries in quick succes- 
sion, using the tried-and-true approach that 
had already eradicated polio in the Western 
Hemisphere: supplementing routine polio 
immunizations with huge countrywide cam- 


paigns several times a year to deliver drops of 
Albert Sabin’s oral polio vaccine (OPV) to 
every child under age 5. 

By 2000, global cases had fallen 99% from 
350,000 to 791, reaching an all-time low of 
483 in 2001. In the process, one of the three 
wild poliovirus serotypes, type 2, was eradi- 
cated almost inadvertently—providing a proof 
of concept that the ambitious plan was indeed 
feasible. By 2006, type | and type 3 virus were 
cornered in just four “endemic” countries— 
India, Nigeria, Afghanistan, and Pakistan— 
where transmission has never been interrupted 
(Science, 26 March 2004, p. 1960). 

But there the initiative stalled, with the 
four endemic countries periodically erupting 
and reinfecting other polio-free countries 
and the global case count hovering between 
about 1000 and 2000. 

As skepticism mounted among 
scientists and weary donors, a 
few advocated throwing in the 
towel on eradication and concen- 
trating instead on keeping the 
virus in check (Science, 12 May 
2006, p. 832). Meanwhile, Bruce 
Aylward, the peripatetic and 
unfailingly optimistic M.D./MPH 
who has led the effort since 1998, 
kept insisting that success was 
just around the corner—just 
another year away. 

That was the context in which 
Chan launched the intensified 
program, pouring in more money 
and resources to determine once and for all 
whether eradication could be achieved. The 
answer, everyone agreed, depended on 
progress in the four endemic countries. 


India seesaws 

In India, more than in any other country, the 
polio fighters were banking on a win in 
2007-08. The plan was to deal a “mortal 
blow” to poliovirus type 1, considered the 
worst player because it causes more paralytic 
disease and spreads faster than the other 
remaining wild virus, type 3. Once type | was 
dispatched, mopping up type 3 in India would 
be easy, they predicted, and would show the 
world the program was on track. 

“Tf we can do it in India, the toughest place 
in the world, we can do it anywhere,” says 
Aylward. Early on, polio experts realized 
India was different; instead of the three to four 
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doses of OPV that had sufficed to stop 
poliovirus transmission elsewhere, in some 
parts of northern India, children needed eight 
or 10 doses to be protected—and still some 
became paralyzed. In the other endemic coun- 
tries, GPEI says the problem is “a failure to 
vaccinate”; in India, by contrast, the problem 
is compounded by “a failure of the vaccine.” 
The country’s huge population, explosive 
birthrate, and crowded, squalid conditions 
combine to create an ideal environment for the 
virus, which is transmitted by feces. 

To tackle this “pernicious transmission,” 
as Aylward calls it, in early 2005 GPEI helped 
rush into use a new, more immunogenic ver- 
sion of OPV—a monovalent form that 
focused all its firepower on just type 1 
(mOPV1). Later that year, mOPV3 was intro- 
duced (Science, 14 January 2005, p. 190). 
With the new vaccines in hand, northern India 
launched its sequential strategy, vowing to 
wipe out type | by the end of 2008. 

Volunteers flooded the country with oral 
polio drops, upping rounds from every 
2 months to every 4 weeks and focusing on the 
toughest districts in the crowded, impover- 
ished states of Uttar Pradesh, outside Delhi, 
where circulation was most intense, and 
Bihar, some 800 kilometers east. 

The results were stunning. Type 1 cases 
across the country dropped from 648 in 2006 
to 73 in 2008. Most remarkable, Uttar 
Pradesh, which Aylward calls the wellspring 
of polio in the country—“every virus in India 
since 1999 has been linked to that area,” he 
says—went 12 months without a case. “It is 
really a hallmark [achievement],” says 
Samuel Katz, a polio expert at Duke Univer- 
sity in Durham, North Carolina, who directs 
GPEI’s newly reconstituted research advi- 
sory committee. 

But in early 2008, India was blindsided by 
a walloping epidemic of type 3 polio. 
Although the program had continued to use 
occasional rounds of mOPV3 and the trivalent 
formulation, tOPV, to forestall just such an 
event, “we didn’t get the balance right,” con- 
cedes David Heymann, who oversees the 
polio program as WHO’ assistant director- 
general for infectious diseases. 

Then in June 2008, type 1 came back to 
Uttar Pradesh. Genetic analyses showed it 
wasn’t a local Uttar Pradesh virus, with its 
distinct genetic signature—instead, it was 
“imported” from Bihar. To scientists, the 
distinction was important—it meant trans- 
mission in Uttar Pradesh had indeed stopped 
for the first time ever—but that still left the 
country battling an epidemic on two fronts, 
with cases in 2008 down from 2007 but still, 
at 556, alarmingly high. 
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At a November 2008 meeting, the India 
Expert Advisory Group, which oversees the 
country’s effort, vowed to continue the fight 
into 2009, again focusing on type | but 
adding more doses of mOPV3 to keep that 
serotype in check. As a contingency plan, 
WHO and partners are testing a higher 
potency mOPV1, and the country is explor- 
ing whether adding doses of inactivated 
polio vaccine can help boost immunity in 
young children. 

All bets are still on India to be the first of 
the four endemic countries to stop transmis- 
sion of the wild virus. India is the “key to 
donor confidence,” says Heymann. “We 
need a victory.” 


Nightmare in Nigeria 

In contrast, few expected much progress in 
northern Nigeria, where opposition, apathy, 
political instability, and corruption have 
stymied the program for years. But even 
realists didn’t necessarily expect an out- 
break of the magnitude that struck in 2008, 
in which cases in some areas were 10 times 
higher than in 2007. “Polio in Nigeria 
remains a nightmare,” says Oyewale Tomori 
of Redeemer’s University near Lagos, head 
of the country’s expert polio advisory group. 

In May 2008, WHO issued a blunt warn- 
ing that Nigeria posed a risk to the rest of the 
world, threatening to derail the entire global 
effort. It came close to doing that in 
2003-04, when suspicions about vaccine 
safety led several Muslim states in northern 
Nigeria to stop all polio vaccination for up to 
a year (Science, 2 July 2004, p. 24). Asa 
result, virus from Nigeria reinfected 20 pre- 
viously polio-free countries, as far away as 
Indonesia. 

Rumors about vaccine contamination are 
no longer the major imped- 
iment to eradication; 
instead, Nigeria’s problems 
are largely “operational,” 
say GPE] officials, citing a 
lack of political will and the 
government’s failure to pro- 
vide even the most rudi- 
mentary health services. 
Others more bluntly refer to 
“gross incompetence” and 
say graft and corruption figure heavily. As 
Tomori explains, in the past, vaccinators 
might have been promised 40 Nigerian 
nairas a day for their work, but by the time 
government officials skimmed off their 
share, each may have received about 4. 
Those problems have now been fixed, say 
GPEI officials. 

At the epicenter of the epidemic in Kano 


state, 68% of all children have received 
fewer than three doses of OPV, and up to 
30% are “zero-dose.” With 791 cases in 
2008, Nigeria accounted for almost 50% of 
the global total. That number is especially 
frustrating to polio experts because stopping 
transmission in Nigeria should be a cinch 
compared with India. Studies by Nicholas 
Grassly and colleagues at Imperial College 
London have shown that vaccine efficacy is 
high there; that means that transmission of 
the virus should stop when population 
immunity reaches roughly 80%. 

Now, as in 2003, WHO and world leaders 
are trying to shame Nigeria into action. Last 
May, the World Health Assembly passed a res- 
olution singling out Nigeria and calling on the 
country to quickly stop its runaway outbreak. 

The public humiliation may be having 
the desired effect. In July, President Umaru 
Yar’ Adua vowed to redouble the effort. The 
ineffective head of the national polio pro- 
gram has been replaced, the third such 
change in 3 years. “This time it is different; 
the president is on board,” asserts Aylward. 
Tomori is more circumspect, saying he will 
wait to see whether this high-level commit- 
ment actually translates to action on the 
ground. Meanwhile, in 2008, poliovirus 
from Nigeria spread to seven West African 
countries. Other war-torn countries, includ- 
ing Chad and Sudan, are still grappling with 
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Mixed bag. Despite significant advances, polio cases 
in 2008 remained high in the four endemic countries. 


epidemics sparked by earlier “importations” 
from Nigeria. 


Perils in Afghanistan and Pakistan 

In the other two hot spots, Afghanistan and 
Pakistan, violence, political turmoil, religious 
opposition, and the fierce autonomy of local 
leaders render eradication all but impossible. 
Large swaths of both countries are “no-go” 
zones where WHO and other United Nations 
personnel are not allowed to operate. National 
polio teams can still get in but are justifiably 
leery of doing so. Even in “accessible” areas, 
a “climate of fear” prevails, and vaccination 
teams may report going more often than they 
actually do, says epidemiologist Rudi Tanger- 
mann, who oversees efforts in the two coun- 
tries from Geneva. 

In mid-2007, GPEI thought it had made 
significant headway; a “third party” had 
brokered an agreement with the Taliban to 
let polio vaccinators work unimpeded. 
Despite that agreement, in March 2008, two 
polio workers and their driver were killed by 
a suicide bomber in southern Afghanistan, 
where they were traveling to prepare for a 
vaccination campaign. 

Surveillance and monitoring are compro- 
mised as well. “We are peering in from the 
outside,” concedes Aylward. The countries 
constitute one epidemiologic block, with 
two transmission zones where the wild virus 
travels freely across the border. One is in 
Pakistan’s rugged and inhospitable North- 
West Frontier Province and the federally 
administered tribal areas, where the Taliban 
and Al Qaeda are resurgent. The second 
extends from southern Afghanistan, near 
Kandahar, through 
Baluchistan, and then 
stretches all the way 
to northern Sindh in 
central Pakistan. 

Ofthe two, Pakistan 
is the bigger worry, 
says Tangermann. The 
number of cases rose in 
Afghanistan in 2008, 
but nowhere near as 
high as they did in Pakistan, where type 1 
exploded and the virus spread into previ- 
ously polio-free areas. In Afghanistan, 
President Hamid Karzai has pledged his 
support for eradication; Pakistan, on the 
other hand, “must become more committed 
under its new government,” says Heymann. 

Fundamentalist leaders in Pakistan have 
issued fatwahs saying the vaccine is unsafe 
and threatening vaccinators. “Refusals” have 
risen considerably. In February 2007, a Pak- 
istani doctor and his driver were killed by a 
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RETHINKING THE POLIO ENDGAME 


One of the toughest conundrums for the long-running Global Polio Eradica- 
tion Initiative has been whether and when it would be safe to stop vaccinating 
once they deem the virus gone. Now, the thinking is undergoing a major shift. 

The first big complication came in 1999 when scientists realized that the 
weakened virus used in the live oral polio vaccine (OPV) could revert to its 
neurovirulent form in rare cases and spark an epidemic. Thus was born the 
“OPV paradox”: OPV was necessary to eradicate the virus, but as long as 
OPV was in use, eradication could never be achieved. As a solution, World 
Health Organization (WHO) scientists proposed a 
plan: After the world was certified polio-free, all 
countries would stop using OPV simultaneously, as 
if at the stroke of midnight. 

Some scientists dismissed the idea as folly and 
instead advocated universal use of the inactivated 
polio vaccine (IPV), already widely used in devel- 
oped countries. That would be the only way to 
ensure the world was really safe, they argued, and 
the only way to prevent a gross inequity in which poor countries bore all 
the risk of polio. 

For years, WHO maintained that such a switch wasn’t feasible: IPV was too 
expensive for poor countries, it must be injected, and its effectiveness is 
unproven in tropical settings. Now, experts such as Roland Sutter, who heads 
WHO research in Geneva, Switzerland, concede that IPV does have a role after 
all. “The world will be a much safer place if more countries use IPV,” he says. 

To make that possible, WHO is now dusting off some earlier studies 


“The world will be a much safer 
place if more countries use 
[inactivated polio vaccine]” 
when the virus is deemed gone. 
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sion of IPV. WHO is looking at “dose sparing” strategies that could bring 
down the cost. In Cuba and Oman, it is testing the efficacy of using one- 
fifth the normal dose, delivered intradermally with an injection gun 
instead of intramuscularly with a needle. Other projects are trying to 
“stretch” the antigen with new adjuvants. 

Another big push is for what is called a “Sabin IPV.” One of the draw- 
backs of the standard Salk IPV is that production starts with the dangerous 
wild virus, which is then killed. To reduce the chances of an accidental 
release, IPV is manufactured only in facilities that operate under strict bio- 
containment procedures and only in countries that maintain a very high 
population immunity against polio. Both requirements rule out transferring 
the technology to developing-country manufactur- 
ers, which would bring down the cost. 

A Sabin IPV would use the less infectious atten- 
uated strain from the oral vaccine as its seed stock, 
providing “a margin of safety” should an accident 
occur, says Sutter. Several clinical trials of a Sabin 
IPV are ongoing; if all goes well, it could be intro- 
—ROLAND SUTTER duced within 5 to 8 years. 

Ultimately, says Bruce Aylward, who runs 
WHO's eradication initiative, “| want something much better than Sabin 
IPV.” Several groups are working on manipulating the virus to make safer 
seed strains that could be handled under less stringent safeguards. It’s still 
early days, but there are several promising leads, including a virus that can’t 
survive at body temperature. 

Sutter and Aylward say each country will decide whether to continue 
vaccinating. But if countries do choose to continue, they want to have in 
place a “cost neutral” vaccine that delivers the same protection as OPV at 


and investing heavily in new research into a cheaper, more effective ver- 


remote bomb while they were returning from 
a village where they were trying to persuade 
parents to let their children be vaccinated. 

Equally unsettling, the Geneva team sus- 
pects that the program in Pakistan is weaker 
than they imagined and that the viral foe may 
be tougher. Earlier reports that vaccination 
teams reached 95% of the target children seem 
to have been fabricated, says Tangermann. 
And recent studies by Grassly and colleagues 
at Imperial suggest that viral transmission is 
much more efficient in Pakistan than previ- 
ously believed, closer to that of India than that 
of Nigeria. “Pakistan is the only place we 
really have questions about what we are deal- 
ing with,” says Aylward. 

To get a better fix on the biology, WHO 
and Pakistani partners are planning studies to 
measure antibodies to the virus in children in 
Karachi, Peshawar, and Lahore. “We have to 
see how effective the vaccine is and how well 
the program is working,” says Aylward. 

On the political front, Aylward has been 
trying to work his magic. At a high-level 
meeting last December in Islamabad, he and 
other partners got assurances from Minister 
of Health Mir Aijaz Hussain Jakhrani that 
Pakistan would make eradication a priority. 

For now, says Heymann, the most the pro- 


gram can hope to achieve there is to show it 
can stop transmission in conflict-free regions, 
like Punjab, where despite repeated cam- 
paigns, circulation remains intense. For the 
other areas, they wait. “We may be quite slow 
in areas with security problems,” he says. 


Vote of confidence 

Despite the upsurge of cases in 2008, Aylward 
insists that the world is much closer to eradica- 
tion than it was a year ago. Chan has declared 
polio eradication WHO’s “top operational pri- 
ority,’ saying in a speech in June, “The credi- 
bility of not just WHO but of many other 
health initiatives is on the line.” She is organ- 
izing an independent review to figure out 
what went wrong in each country and what the 
program could do better. 

The global oversight body, the Advisory 
Committee on Polio Eradication, is on board 
as well; in December, it endorsed GPEI’s 
strategic plan for 2009 to 2013. Although Ayl- 
ward is leery of firm deadlines, the plan calls 
for interrupting type | transmission in India 
by the end of 2009 and type 3 the following 
year. They hope to wipe out both types in 
Afghanistan and Pakistan in 2010, but Nigeria 
might take a year longer. All that depends, of 
course, on donors keeping their checkbooks 


the same price—without the risk. 
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open and countries putting their mind and 
muscle behind eradication. 

The donors have stepped up. With the 
$635 million infusion from Gates and others, 
funding looks better than it has in years. The 
$255 million Gates grant is the largest single 
donation since Rotary International kicked 
off the effort in 1988 with $240 million. 

Aylward keeps up his unrelenting sched- 
ule, visiting endemic and reinfected countries 
to spur or prod them into action. For the tough- 
est spots, the big guns go too, such as Chan, 
Heymann, and the newest advocate, Bill 
Gates, who is also championing malaria erad- 
ication and who visited India in December 
2008 and Nigeria just last week. 

For now, the global health community 
seems willing to give the eradication initiative 
more time. There are still skeptics who say it 
will never be finished. But most agree with 
Wright. “It’s terribly hard,” he says. “All the 
models suggest it is not a good idea to give up 
on the program.” 

“We won’t let up,” said Aylward in an inter- 
view from the noisy Brazzaville airport in 
Congo en route to Islamabad. “I will person- 
ally push it over the line if I have to. We still 
have very long sleeves and lots of tricks up 
them if we need them.” -LESLIE ROBERTS 
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RETHINKING THE POLIO ENDGAME 


One of the toughest conundrums for the long-running Global Polio Eradica- 
tion Initiative has been whether and when it would be safe to stop vaccinating 
once they deem the virus gone. Now, the thinking is undergoing a major shift. 

The first big complication came in 1999 when scientists realized that the 
weakened virus used in the live oral polio vaccine (OPV) could revert to its 
neurovirulent form in rare cases and spark an epidemic. Thus was born the 
“OPV paradox”: OPV was necessary to eradicate the virus, but as long as 
OPV was in use, eradication could never be achieved. As a solution, World 
Health Organization (WHO) scientists proposed a 
plan: After the world was certified polio-free, all 
countries would stop using OPV simultaneously, as 
if at the stroke of midnight. 

Some scientists dismissed the idea as folly and 
instead advocated universal use of the inactivated 
polio vaccine (IPV), already widely used in devel- 
oped countries. That would be the only way to 
ensure the world was really safe, they argued, and 
the only way to prevent a gross inequity in which poor countries bore all 
the risk of polio. 

For years, WHO maintained that such a switch wasn’t feasible: IPV was too 
expensive for poor countries, it must be injected, and its effectiveness is 
unproven in tropical settings. Now, experts such as Roland Sutter, who heads 
WHO research in Geneva, Switzerland, concede that IPV does have a role after 
all. “The world will be a much safer place if more countries use IPV,” he says. 

To make that possible, WHO is now dusting off some earlier studies 


“The world will be a much safer 
place if more countries use 
[inactivated polio vaccine]” 
when the virus is deemed gone. 
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sion of IPV. WHO is looking at “dose sparing” strategies that could bring 
down the cost. In Cuba and Oman, it is testing the efficacy of using one- 
fifth the normal dose, delivered intradermally with an injection gun 
instead of intramuscularly with a needle. Other projects are trying to 
“stretch” the antigen with new adjuvants. 

Another big push is for what is called a “Sabin IPV.” One of the draw- 
backs of the standard Salk IPV is that production starts with the dangerous 
wild virus, which is then killed. To reduce the chances of an accidental 
release, IPV is manufactured only in facilities that operate under strict bio- 
containment procedures and only in countries that maintain a very high 
population immunity against polio. Both requirements rule out transferring 
the technology to developing-country manufactur- 
ers, which would bring down the cost. 

A Sabin IPV would use the less infectious atten- 
uated strain from the oral vaccine as its seed stock, 
providing “a margin of safety” should an accident 
occur, says Sutter. Several clinical trials of a Sabin 
IPV are ongoing; if all goes well, it could be intro- 
—ROLAND SUTTER duced within 5 to 8 years. 

Ultimately, says Bruce Aylward, who runs 
WHO's eradication initiative, “| want something much better than Sabin 
IPV.” Several groups are working on manipulating the virus to make safer 
seed strains that could be handled under less stringent safeguards. It’s still 
early days, but there are several promising leads, including a virus that can’t 
survive at body temperature. 

Sutter and Aylward say each country will decide whether to continue 
vaccinating. But if countries do choose to continue, they want to have in 
place a “cost neutral” vaccine that delivers the same protection as OPV at 


and investing heavily in new research into a cheaper, more effective ver- 


remote bomb while they were returning from 
a village where they were trying to persuade 
parents to let their children be vaccinated. 

Equally unsettling, the Geneva team sus- 
pects that the program in Pakistan is weaker 
than they imagined and that the viral foe may 
be tougher. Earlier reports that vaccination 
teams reached 95% of the target children seem 
to have been fabricated, says Tangermann. 
And recent studies by Grassly and colleagues 
at Imperial suggest that viral transmission is 
much more efficient in Pakistan than previ- 
ously believed, closer to that of India than that 
of Nigeria. “Pakistan is the only place we 
really have questions about what we are deal- 
ing with,” says Aylward. 

To get a better fix on the biology, WHO 
and Pakistani partners are planning studies to 
measure antibodies to the virus in children in 
Karachi, Peshawar, and Lahore. “We have to 
see how effective the vaccine is and how well 
the program is working,” says Aylward. 

On the political front, Aylward has been 
trying to work his magic. At a high-level 
meeting last December in Islamabad, he and 
other partners got assurances from Minister 
of Health Mir Aijaz Hussain Jakhrani that 
Pakistan would make eradication a priority. 

For now, says Heymann, the most the pro- 


gram can hope to achieve there is to show it 
can stop transmission in conflict-free regions, 
like Punjab, where despite repeated cam- 
paigns, circulation remains intense. For the 
other areas, they wait. “We may be quite slow 
in areas with security problems,” he says. 


Vote of confidence 

Despite the upsurge of cases in 2008, Aylward 
insists that the world is much closer to eradica- 
tion than it was a year ago. Chan has declared 
polio eradication WHO’s “top operational pri- 
ority,’ saying in a speech in June, “The credi- 
bility of not just WHO but of many other 
health initiatives is on the line.” She is organ- 
izing an independent review to figure out 
what went wrong in each country and what the 
program could do better. 

The global oversight body, the Advisory 
Committee on Polio Eradication, is on board 
as well; in December, it endorsed GPEI’s 
strategic plan for 2009 to 2013. Although Ayl- 
ward is leery of firm deadlines, the plan calls 
for interrupting type | transmission in India 
by the end of 2009 and type 3 the following 
year. They hope to wipe out both types in 
Afghanistan and Pakistan in 2010, but Nigeria 
might take a year longer. All that depends, of 
course, on donors keeping their checkbooks 


the same price—without the risk. 


-L.R. 


open and countries putting their mind and 
muscle behind eradication. 

The donors have stepped up. With the 
$635 million infusion from Gates and others, 
funding looks better than it has in years. The 
$255 million Gates grant is the largest single 
donation since Rotary International kicked 
off the effort in 1988 with $240 million. 

Aylward keeps up his unrelenting sched- 
ule, visiting endemic and reinfected countries 
to spur or prod them into action. For the tough- 
est spots, the big guns go too, such as Chan, 
Heymann, and the newest advocate, Bill 
Gates, who is also championing malaria erad- 
ication and who visited India in December 
2008 and Nigeria just last week. 

For now, the global health community 
seems willing to give the eradication initiative 
more time. There are still skeptics who say it 
will never be finished. But most agree with 
Wright. “It’s terribly hard,” he says. “All the 
models suggest it is not a good idea to give up 
on the program.” 

“We won’t let up,” said Aylward in an inter- 
view from the noisy Brazzaville airport in 
Congo en route to Islamabad. “I will person- 
ally push it over the line if I have to. We still 
have very long sleeves and lots of tricks up 
them if we need them.” -LESLIE ROBERTS 
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EVOLUTIONARY BIOLOGY 


Agreeing to Disagree 


Two friends debate the relative importance of kinship in the 
evolution of complex social systems in insects 


When Bert H6lldobler and Edward O. Wilson 
wrote their first book, The Ants, it was a 
labor of love. Passionate about the insects 
they studied, they produced a Pulitzer 
Prize-winning work that brought worldwide 
attention to the sophisticated social world of 
ants. But when they sat down to write the 
sequel more than a dozen years later, the 
honeymoon was over. 

For these two biologists, ants, bees, 
wasps, and termites hold a special place in 
nature, having achieved the ultimate in altru- 
istic behavior. In these so-called eusocial 
species, one or a few females produce young 
that other individuals in the colony care for 
in lieu of producing their own offspring. 
Twenty years ago, Wilson, based at Harvard 
University, and Hélldobler, now based 


Heave ho. Weaver ants 
work together to build 
nests for colonies with 
millions of members. 


at Arizona State 
University, Tempe, 
attributed this seem- 
ingly self-sacrific- 
ing behavior to the 
unusually close kin- 
ship among colony 
members. 

Yet, over the past 
4 years, the duo has 
struggled to recon- 
cile new informa- 
tion indicating that 
some social insects 
are exceptions to the 
rules that guided 
their thinking when 
they wrote The Ants. 
Although their new 
book, The Super- 
organism, tackles 
the evolution of 
eusociality, the two 
researchers are at 
odds over how these 
social systems got 
started. 

“We agree on 
everything except 
what happens on 
that one magic step,” 
Wilson says. He no 
longer argues that 
kinship is the under- 
lying driving force, 
suggesting instead 
that what tips the 
scales in favor of 
eusociality is a particular set of environmental 
conditions that leads to the guarding of the 
young in nests close to dependable food 
sources. Over the past 2 years, he has pushed 
hard to get the field to see his new perspec- 
tive. And Hélldobler has pushed back. 
“There is nothing wrong with kin selection,” 
H6lldobler insists. “It’s a very important 
concept to understanding the early evolution 
of eusociality—how it begins.” 

Others wonder what the fuss is about. 
“It’s a tempest in a teapot,” says Hudson 
Kern Reeve, an evolutionary biologist 
at Cornell University. “It’s distracting us 
from the really interesting questions.” But 
Wilson disagrees: “If we can’t understand 
the beginning of eusociality, our whole 
view of it is wobbly.” 


Living large 

Eusociality represents the ultimate in suc- 
cessful living, say Hélldobler and Wilson. 
Summed together, the 20,000 species of ants, 
bees, wasps, and termites represent two- 
thirds of the insect biomass, even though they 
account for just 2% of the insect species. Ants 
in tropical rainforests outweigh all the verte- 
brate inhabitants combined. (All ants and ter- 
mites are eusocial; some bees and wasps are.) 

Their colonies, nests, and hives actually 
represent “superorganisms,” a concept intro- 
duced 80 years ago by William Morton 
Wheeler and reemphasized by Hélldobler 
and Wilson in their new book. Many organ- 
isms form groups, but in the superorganism, 
the group operates with a minimum of inter- 
nal strife; most group members work toward 
the common good with little apparent mind 
to individual self-interest. 

That attitude had Charles Darwin flum- 
moxed. He called social insects a “special 
difficulty” capable of toppling his theory of 
evolution. Natural selection should favor 
individuals that always put themselves—and 
the propagation of their own genes—first. Yet 
that didn’t seem to be happening with worker 
bees or soldier ants, which often have no 
reproductive capability at all and instead 
strive to keep their queen and her offspring 
well-fed and protected. Darwin suspected 
that family ties might explain this apparent 
paradox—with selection operating at the 
family and not the individual level. 

In the 1960s, William Hamilton, a British 
biologist building on work by a fellow country- 
man, geneticist J. B. S. Haldane, put forth a 
very convincing theory, often called kin 
selection, that explained eusociality: Such 
altruism could evolve when a queen propa- 
gated enough of her workers’ genes to make 
it worth the workers’ while to help her rather 
than reproduce independently. 

In the 1970s, Wilson became one of kin 
selection’s greatest proponents. Textbooks 
embraced the idea, and theoreticians worked 
hard to refine their understanding of how kin 
selection worked. “It became dogma,” says 
Wilson. Yet when Wilson first started to put 
together the new book’s chapter discussing 
the origin of eusociality, he ran into trouble. 
“In the last few years, the picture had 
changed considerably,” he says. Since the 
publication of The Ants, field biologists had 
discovered many variations on the eusocial- 
ity theme. Eusocial termites, for example, do 
not have the so-called haploid-diploid 
genetic system that was critical to Hamilton’s 
explanation of kin selection, and some 
species that do have those genetics haven’t 
evolved eusociality. 
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Moreover, it became clear “there are 
counterforces that oppose close-pedigree kin- 
ship in the early evolution of social insects,” 
Wilson points out. Genetic diversity leads to 
greater resistance to disease and other destruc- 
tive forces, and outbreeding does occur in 
these systems. He cites the example of primi- 
tive wasps, in which individuals in the same 
colony tend to become less related as time 
passes. He reasons that if close relatedness 
were so important, then these groups should 
continue to be made up of close kin. “Yet 
counterforces against the favoring of close kin 
in the origin of eusociality have not been taken 
into account by the theoreticians,” he criticizes. 

In 2004, Wilson wrote a manuscript on his 
new view and sent it to a half-dozen col- 
leagues, working with them one 
on one to refine his ideas. At the 
time, Hélldobler was on board, 
and together they co-authored a 
perspective in the 20 September 
2005 Proceedings of the National 
Academy of Sciences. They 
pointed out that three forces were 
at work in shaping an insect soci- 
ety: group, individual, and “col- 
lateral” kin selection (involving 
relatives other than offspring). 
Group selection occurs when 
one cooperating hive or nest out- 
competes another one; individ- 
ual selection involves the sur- 
vival and reproductive output of 
a particular ant, wasp, or bee; 
and kin selection has an impact 
when relatives other than off- 
spring help spread shared genes. 

In that paper, Wilson and 
H6lldobler analyzed the condi- 
tions under which each type of 
selection tended to enhance or 
undermine cooperation, and how 
strongly. They concluded that the 
degree of relatedness primarily 
affects how quickly a social sys- 
tem changes but not whether 
cooperation was favored in the 
first place. ““Eusociality ... can, 
in theory at least, be initiated by 
group selection in either the 
presence or absence of close 
relatedness,” they wrote. But it 
could not arise without group selection. 

What was important was not relatedness, 
they argued, but the evolution ofa gene variant 
that caused offspring to linger at the nest 
rather than disperse and an environment that 
favored cooperation. Researchers have yet to 
identify such an allele, but several researchers 
are looking, particularly in species that some- 


times cooperate and sometimes live solitary 
lives, depending on the circumstances. “The 
preadaptation for eusociality will always 
[involve] a family and in that family, the indi- 
viduals will be closely related,’ notes Wilson. 
“But that’s very different from saying that 
close relatedness brought them to take that last 
step [into eusociality].” 

As evidence that kinship is not critical, they 
cited examples of species that appear to be on 
the first steps toward eusociality, such as xylo- 
copine carpenter bees. These bees form tem- 
porary semisocial groups that begin when one 
member of a pair bullies the other into becom- 
ing a worker while the first becomes the queen. 
Workers that are related to the queen tend to 
leave, but nonrelatives tend to stick around, 


Conflict resolution. Holldobler (top, left) and Wilson debated 
how to cover the origins of eusociality in their new book, which 
discusses social insects including termites. In dampwood 
termites (bottom), workers and nymphs feast on a queen killed 
when two colonies converged. 


suggesting that kinship has little to do with the 
cohesiveness of the group. Likewise, primi- 
tively eusocial wasps will set up group living 
arrangements with either kin or nonkin. 
Wilson and Hdlldobler also argued that 
because eusociality is quite rare—these altruis- 
tic societies have emerged only about a dozen 
times and exist in just 15 out of the 2600 known 
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insect and arthropod families—it takes just the 
right combination of factors to set the stage for 
it. For insects, the appropriate setting involves 
maternal behavior in which the female builds a 
nest, gathers food, and feeds the young. In that 
situation, it helps to have someone guarding the 
young while the queen is foraging. 

Moreover, if the environment has ample 
food but a lot of competition, “that kind of pres- 
sure would make a group superior over an indi- 
vidual,” says Wilson. Consider primitive ter- 
mites that nest in dead trees in California. Many 
king-queen pairs settle into a tree, but eventu- 
ally one colony will take over the entire trunk, 
depending on how strong a cooperative unit 
that pair establishes. “It’s solid group selec- 
tion,” says Wilson. 


The rift 

Wilson continued to expand on these ideas. In 
the January 2008 issue of BioScience, he reit- 
erated that collateral kin selection does not 
play a significant role and reemphasized that 
the road to eusociality entails setting up coop- 
erative housekeeping and foraging from a 
local, dependable food source. 

The numbers support his view, he says. No 
eusocial species exist among the 70,000 
species of parasitoid wasps and their relatives 
that travel from prey to prey to lay their eggs; 
whereas it arose seven times in the 55,000 
aculeate (stinging) wasps that keep paralyzed 
prey in nests. Among the 9000 species of 
aphids and thrips, only those few that induce 
plant hosts to form galls—a nest of sorts— 
have become eusocial even though most of 
them form groups. And out of 10,000 decapod 
crustaceans, only snapping shrimp, which live 
in sponges, are eusocial, he reported. 

In these settings, species with the flexibil- 
ity to live in groups can win out. Wilson 
thinks today’s eusocial species likely had 
ancestors similar to a Japanese stem-nesting 
xylocopine bee, Ceratina flavipes: Most of 
the time females make do on their own, but 
every once in a while, they pair up and divide 
the labor, setting the stage for a “group” to 
outdo individuals and for the tendency to 
form groups to be favored. 

And, he argues, it may not take many 
genes to get a group to form. Studies of bac- 
teria and fungi show that a single gene can 
have a profound effect on dispersal. A loss of 
the dispersal instinct positions individuals to 
become helpers. 

About this time, Wilson began collaborat- 
ing closely with David Sloan Wilson of Bing- 
hamton University in New York. After organ- 
izing a discussion group with about 20 others, 
they published an expanded explanation of 
these ideas in the December 2007 issue of 
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The Quarterly Review of Biology and in the 
September-October 2008 issue of American 
Scientist. Kin-selection theory, they argue, is 
ineffective. “The theory that traditionalists use 
leads them anywhere they want to go” and fails 
to make useful predictions, Wilson asserts. “To 
make [a theory] really stand [up], you have to 
show that that’s the only result that can come 
from your theory, and they haven’t done that.” 

Meanwhile, Hélldobler became convinced 
that kin selection really is critical 
for the origin of eusociality after 
all. He became increasingly 
unhappy with what Wilson and he 
had written in 2005, concluding 
that they had not made a clear 
enough distinction between coop- 
eration and eusociality and that 
their evidence included suppos- 
edly primitively eusocial species 
that were really more advanced 
than that. “You don’t need to be 
closely related for the evolution of 
cooperative groups, but eusocial- 
ity is very special,” he notes. 

“Reeve and I set the record 
straight,” he says, in the 5 June 
2007 issue of the Proceedings of 
the National Academy of Sciences. 
They emphasized that because, 
mathematically, group and kin 
selection were equivalent, the 
argument that group rather than 
kin selection lay at the heart of the 
origin of eusociality was spurious. 
“The fuss about kin selection ver- 
sus group selection is quite point- 
less,’ says Hélldobler. 

He sees the evolution of euso- 
ciality as a two-step process acti- 
vated by ecological conditions 
and operating on closely related 
individuals. First, a subsocial 
species must work out an arrange- 
ment in which offspring that are 
capable of reproducing nonethe- 
less help the mother raise siblings. 
Next, that arrangement becomes 
cast in stone, with sterile workers. 

In primitive ants, workers are 
almost the size of the queen, 
still have functional ovaries, and 
can mate, but as long as the 
mother remains fertile, they usu- 
ally refrain from laying eggs. 
Evolution to eusociality often 
stalls at this first primitive stage, 
Hdlldobler notes. Colonies can 
never get bigger than what the 
queen and worker nestmates can 
police because individuals might 


otherwise “cheat” the system and reproduce 
on their own. 

But under certain circumstances, bigger, 
more harmonious groups outdo rivals and 
evolve full-fledged eusocial networks, in 
which millions of sterile specialized individ- 
uals work together as a superorganism. In 
these cases, competition between groups 
favors cooperation within a group, and com- 
petition between individuals in the group dis- 


Promising baainntiias At their nests, a female paper wasp (top) and a female 
sweat bee set up shop where early offspring may become workers that help raise 
the next generation. 


sipates. “At that point, it’s not important to 
have high relatedness,” says Hélldobler. And, 
as he and Wilson agree, it can be healthier for 
a colony to have genetic diversity. 
H6lldobler’s opinion seems to hold the 
majority. “On this one issue, it is largely 
Wilson on one side and most of the rest of 
us on the other,” says David Queller, an evo- 
lutionary biologist at Rice University in 
Houston, Texas. “Like many researchers, I 
have become convinced that 
variation in costs and benefits is 
probably more important than 
variation in relatedness favored 
by Hamilton, but relatedness is 
still an essential part of it, and it 
all fits into Hamilton’s kin- 
selection paradigm.” 
Yet there are dissenters other 
than Wilson and Wilson. “Relat- 
edness is not needed to 
explain the origin of 
worker behavior,” says 
James Hunt, a biologist 
who studies wasps at North 
Carolina State University in 
Raleigh. “A mountain of data [has 
shown] that neither high related- 
ness nor asymmetry of related- 
ness among nestmates is neces- 
sary for the origin of sociality.” 
This rift slowed progress on 
The Superorganism. Originally, 
Wilson was supposed to write 
the chapter on the genetics and 
evolution of eusociality, but after 
5 years, it was still not done, with 
the two authors at odds about 
how to proceed. Finally in July 
2007, Wilson handed his pen 
over to Hoélldobler, who labored 
over a version that Wilson even- 
tually said he could live with as 
long as there were references to 
Wilson’s perspective. The chapter 
“represents the best compromise 
that Bert and I could reach,” says 
Wilson. And now, on the book- 
signing circuit for their new 
book, Wilson and Holldobler 
downplay their points of depar- 
ture. “We were worried that that 
' “4 chapter would distract the reader 
VF from the wonderful natural his- 
tory,” says Hélldobler. So they 
beg off questions related to the 
controversy. 
And they remain fast friends. 
In the end, says Wilson, “we 
agreed to disagree.” 
—-ELIZABETH PENNISI 
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On the Origin of 
Art and Symbolism 


Since their discovery by French spelunkers 
in 1994, the magnificent lions, horses, and 
rhinos that seem to leap from the walls of 
Chauvet Cave in southern France have 
reigned as the world’s oldest cave paint- 
ings. Expertly composed in red ochre and 
black charcoal, the vivid drawings demon- 
strate that the artistic gift stretches back 
more than 30,000 years. These paintings 
are almost sure to be mentioned in any arti- 
cle or paper about the earliest art. But what 
do they really tell us about the origins of 
artistic expression? 

The prehistoric humans who decorated 
Chauvet’s walls by torchlight arrived at the 
cave with their artistic genius already in full 
flower. And so, most researchers agree that 
the origins of art cannot simply 
be pegged to the latest discovery 
of ancient paintings or sculp- 
ture. Some of the earliest art 
likely perished over the ages; 
much remains to be found; and 
archaeologists don’t always 
agree on how to interpret what is 
unearthed. As a result, instead of 
chasing after art’s first appear- 
ance, many researchers seek to 
understand its symbolic roots. 
After all, art is an aesthetic 
expression of something more 
fundamental: the cognitive abil- 
ity to construct symbols that 
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communicate meaning, whether they be the 
words that make up our languages, the musi- 
cal sounds that convey emotion, or the dra- 
matic paintings that, 30,000 years after their 
creation, caused the discoverers of the Chau- 
vet Cave to break down in tears. 

While sites like Chauvet might be vivid 
examples of what some researchers still 
consider a “creative explosion” that began 
when modern humans colonized Europe 
about 40,000 years ago, an increasing num- 
ber of prehistorians are tracing our sym- 
bolic roots much further back in time—and 
in some cases, to species ancestral to Homo 
sapiens. Like modern humans themselves, 
symbolic behavior seems to have its origins 
in Africa. Recent excavations have turned 
up elaborate stone tools, beads, and ochre 
dating back 100,000 or more years ago, 
although researchers are still debating 
which of these finds really demonstrate 
symbolic expression. But there’s wide- 
spread agreement that the building blocks 
of symbolism preceded full-blown art. 
“When we talk about beads and art, we are 
actually talking about material technologies 
for symbolic expression that certainly post- 
date the origins of symbolic thought and 
communication, potentially by a very wide 
margin,” says archaeologist Dietrich Stout 
of University College London. 

The evolution of symbolism was once 
thought to have been as rapid as “flicking on 
a light switch,” as archaeologist Clive Gamble 
of the Royal Holloway, University of Lon- 
don, put it some years ago. But given new 
evidence that symbolic behavior appears 
long before cave paintings, 
Gamble now says that his much- 
cited comment needs to be mod- 
ified: “It’s a dimmer switch now, 
a stuttering candle.” 

As they more precisely pin- 
point when symbolic behavior 
began, scientists are hoping they 
might one day crack the tough- 
est question of all: What was its 
evolutionary advantage to 
humans? Did symbols, as many 
researchers suspect, serve as a 
social glue that helped tribes of 
early humans to survive and 
reproduce? 


Venus, phallus, or pebble? 
“T don’t know much about Art, but I know what 
I like,” quipped the humorist and art critic 
Gelett Burgess back in 1906. For archaeo- 
logists, distinguishing art from nonart is still 
quite a challenge. Take the 6-centimeter-long 
piece of quartzite known as the Venus of 
Tan-Tan. Found in Morocco in 1999 next to a 
rich trove of stone tools estimated to be 
between 300,000 and 500,000 years old, it 
resembles a human figure with stubby arms 
and legs. Robert Bednarik, an independent 
archaeologist based in Caulfield South, 
Australia, insists that an ancient human 
deliberately modified the stone to 
make it look more like a person. 
If so, this objet d’art is so old 
that it was created not by our 
own species, which first 
appears in Africa nearly 
200,000 years ago, but 
by one of our ances- 
tors, perhaps the 
large-brained H. 
heidelbergensis, 
thought by some 
anthropologists to 
be the common 
ancestor of mod- @ 
ern humans and *. 
Neandertals. That 
would mean that 
art is an extremely 
ancient part of the Homo 
repertoire. “Ignoring the 
few specimens we have 
of very early paleoart, 
explaining them away, 
or rejecting them out 
of hand does not serve this discipline well,” 
Bednarik wrote in a 2003 analysis of the 
Venus of Tan-Tan in Current Anthropology. 
Yet many archaeologists are skeptical, 
arguing that the stone’s resemblance to a 
human figure might be coincidence. Indeed, 
the debate over the Tan-Tan “figurine” is rem- 
iniscent of a similar controversy over a 
smaller stone discovered in 1981 at the site of 
Berekhat Ram in the Israeli-occupied Golan 
Heights. To some archaeologists, this 
250,000-year-old object resembles a woman, 
but others argue that it was shaped by natural 
forces, and, in any case, looks more like a 
penguin or a phallus. Even after an exhaustive 
microscopic study concluded that the 
Berekhat Ram object had indeed been etched 
with a tool to emphasize what some consider 
its “head” and “arms,” many researchers have 
rejected it as a work of art. For some, proof of 
symbolic behavior requires evidence that the 
symbols had a commonly understood mean- 
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Symmetry in stone. 
Some stone tools 
require a mental 
image to create. 
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ing and were shared within groups of people. 
For example, the hundreds of bone and stone 
“Venus figurines” found at sites across Eura- 
sia beginning about 30,000 years ago were 
skillfully carved and follow a common motif. 
They are widely regarded not only as sym- 
bolic expression, but full-fledged art. 

Thus many researchers are reluctant to 
accept rare, one-off discoveries like the Tan- 
Tan or Berekhat Ram objects as signs of 
symbolic behavior. “You can imagine [an 
ancient human] recognizing a resemblance 
but [the object] still hav[ing] no symbolic 
meaning at all,” says Philip Chase, an anthro- 
pologist at the University of Pennsylvania. 
Thomas Wynn, an anthropologist at the 
University of Colorado, Colorado Springs, 
agrees: “If it’s a one-off, I don’t think it 
counts. It’s not sending a message to anyone.” 


Tools of the imagination 
Given how difficult it is to detect 
the earliest symbolic messages in 
the archaeological record, some 
researchers look instead for proxy 
behaviors that might have 
required similar cognitive abili- 
ties, such as toolmaking. Charles 
Darwin himself saw an evolu- 
tionary parallel between tool- 
making and language, probably 
the most sophisticated form of 
symbolic behavior. “To chip a 
flint into the rudest tool,” Darwin 
wrote in The Descent of Man, 
demands a “perfect hand” as well 
adapted to that task as the “vocal 
organs” are to speaking. 

To many researchers, making 
sophisticated tools and using 
symbols both require the capac- 


ity to hold an abstract concept in one’s head— 
and, in the case of the tool, to “impose” a pre- 
determined form on raw material based on an 
abstract mental template. That kind of ability 
was probably not needed to make the earliest 
known tools, say Wynn and other researchers. 
These implements, which date back 2.6 mil- 
lion years, consist mostly of rocks that have 
been split in two and then sharpened to make 
simple chopping and scraping implements. 
Then, about 1.7 million years ago, large, 
teardrop-shaped tools called Acheulean hand 
axes appeared in Africa. Likely created by 
H. erectus and probably used to cut plants and 
butcher animals, these hand-held tools vary 
greatly in shape, and archaeologists have 
debated whether creating the earliest ones 
required an abstract mental template. But by 
about 500,000 years ago, ancient humans were 
creating more symmetrical Late Acheulean 


“If its a one-off, | don’t think 
it counts. It’s not sending a 
message to anyone.” 


—THOMAS WYNN, UNIVERSITY OF COLORADO, 
COLORADO SPRINGS 


Symbolic start. Some scientists argue that this 77,000-year-old engraved ochre 
shows symbolic capacity. 


tools, which Wynn and many others argue are 
clear examples of an imposed form based on a 
mental template. Some have even argued that 
these skillfully crafted hand axes had symbolic 
meanings, for example to display prestige or 
even attract members of the opposite sex. 

The half-million-year mark also heralded 
the arrival of H. heidelbergensis, which had 
a much larger brain than H. erectus. Not 
long afterward, our African ancestors began 
to create a wide variety of finely crafted 
blades and projectile points, which allowed 
them to exploit their environment in more 
sophisticated ways, and so presumably 
enhance their survival and reproduction. 
Archaeologists refer to these tools as Middle 
Stone Age technology and agree that they 
did require mental templates. “The tools tell 
us that the hominid world was changing,” 
says Wynn. 

As one moves forward in time, humans 
appear able to imagine and create even more 
elaborate tools, sharpening their evolution- 
ary edge in the battle for survival. By 
260,000 years ago, for example, ancient 
humans at Twin Rivers in what is now Zambia 
could envision a complex finished tool and 
put it together in steps from different compo- 
nents. They left behind finely made blades 
and other tools that had been modified— 
usually by blunting or “backing” one edge— 
to be hafted onto handles, presumably made 
of wood. These so-called backed tools have 
been widely regarded as evidence of sym- 
bolic behavior when found at much younger 
sites. “This flexibility in stone tool manufac- 
ture [indicates] symbolic capabilities,” says 
archaeologist Sarah Wurz of the Iziko Muse- 
ums of Cape Town in South Africa. 

Similar cognitive abilities were possibly 
required to make the famous 
400,000-year-old wooden spears 
from Schéningen, Germany. One 
recent study concludes that these 
spears’ creators—probably mem- 
bers of H. heidelbergensis—car- 
ried out at least eight preplanned 
steps spanning several days, 
including chopping tree branches 
with hand axes and shaping the 
spears with stone flakes. 

The idea that sophisticated 
toolmaking and symbolic 
thought require similar cognitive 
skills also gets some support 
from a surprising quarter: brain- 
imaging studies. Stout’s team ran 
positron emission tomography 
scans on three archaeologists— 
all skillful stone knappers—as 
they made pre-Acheulean and 
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Late Acheulean tools. Both methods turned 
on visual and motor areas of the brain. But 
only Late Acheulean knapping turned on cir- 
cuits also linked to language, the team 
reported last year. 


Color me red 

At Twin Rivers, it’s not just the tools that hint 
at incipient symbolic behavior. Early humans 
there also left behind at least 300 lumps of 
ochre and other pigments in a rainbow of col- 
ors: yellow, red, pink, brown, purple, and 
blue-black, some of which were gathered far 
from the site. Excavator Lawrence Barham 
of the University of Liverpool in the United 
Kingdom thinks they used the ochre to 
paint their bodies, though there’s little 
hard evidence for this. Most archaeolo- 
gists agree that body painting, as well 
as the wearing of personal orna- 
ments such as bead necklaces, 
was a key way that early humans 
symbolically communicated 
social identity such as member- 
ship in a particular group, much 
as people today declare social 
allegiances and individual person- 
alities by their clothing and jewelry. 

Yet while the Twin Rivers evidence is 
suggestive, it’s hard to be sure how the ochre 
was actually used. There’s little sign that it was 
ground into powder, as needed for decoration, 
says Ian Watts, an independent ochre expert in 
Athens. And even ground ochre could have 
had utilitarian uses, says archaeologist Lyn 
Wadley of the University of Witwatersrand 
in Johannesburg, South Africa. Modern-day 
experiments have shown that ground ochre 
can be used to tan animal hides, help stone 
tools adhere to bone or wooden handles, and 
even protect skin against mosquito bites. 

“We simply don’t know how ancient peo- 
ple used ochre 300,000 years ago,” Wadley 
says. And since at that date the ochre users 
were not modern humans but our archaic 
ancestors, some experts are leery of assign- 
ing them symbolic savvy. 

Yet many archaeologists are 
willing to grant that our species, 
H. sapiens, was creating and 
using certain kinds of symbols 
by 75,000 years ago and per- 
haps much earlier. At sites such 
as Blombos Cave on South 
Africa’s southern Cape, people 
left sophisticated tools, including elabo- 
rately crafted bone points, as well as perfo- 
rated beads made from snail shells and 
pieces of red ochre engraved with what 
appear to be abstract designs. At this single 
site, a number of what many archaeologists 


darwin. 


Online 


sciencemag.org 
Hear author 
Michael Balter 

discuss the roots of art at 

www.sciencemag.org/ 


Eye of the beholder. Archaeologists debate whether 
this modified stone was meant to represent a woman. 


consider diagnostic elements of symbolic 
behavior came together. And in work now 
in press, the Blombos team reports finding 
engraved ochre in levels dating back to 
100,000 years ago (Science, 
30 January, p. 569). 

There are other hints that the 
modern humans who ventured 
out of Africa around this time 
might also have engaged in sym- 
bolic behavior. At the Skhul rock 
shelter in Israel, humans left 
100,000-year-old shell beads 
considered by some to be personal orna- 
ments (Science, 23 June 2006, p. 1731). At 
the 92,000-year-old Qafzeh Cave site 
nearby, modern humans apparently strongly 
preferred the color red: Excavators have 
studied 71 pieces of bright red ochre associ- 


ORIGINS lL. 


ated with human burials. Some researchers 
argue that this represents an early case of 
“color symbolism,” citing the universal 
importance of red in historical cultures 
worldwide and the apparently great lengths 
to which early humans went to gather red 
ochre. There is very strong circumstantial 
evidence for the very great antiquity of the 
color red as a symbolic category,” says 
anthropologist Sally McBrearty of the 

University of Connecticut, Storrs. 

These finds of colorful ochre, fancy 
tools, and beads have convinced many 
researchers that the building blocks of 

symbolism had emerged by at least 

100,000 years ago and possibly 

much earlier. But why? What 
selective advantages did using 
symbols confer on our ancestors? 
To some scientists, the ques- 
tion is a no-brainer, especially 
when it is focused on the most 
sophisticated form of symbolic 
communication: language. The 
ability to communicate detailed, 
concrete information as well as 
abstract concepts allowed early 
humans to cooperate and plan for the 
future in ways unique to our species, 
thus enhancing their survival during 
rough times and boosting their repro- 
ductive success in good times. “What 
aspects of human social organization and 
adaptation wouldn’t benefit from the evolu- 
tion of language?” asked Terrence Deacon, a 
biological anthropologist at the University 
of California, Berkeley, in his influential 
book The Symbolic Species: The Coevolu- 
tion of Language and the Brain. Deacon 
went on to list just some of the advantages: 
organizing hunts, sharing food, teaching 
toolmaking, sharing past experiences, and 
raising children. Indeed, many researchers 
have argued that symbolic communication is 
what held groups of early humans together 
as they explored new environments and 
endured climatic shifts. 

As for art and other nonlinguistic forms 
of symbolic behavior, they may also have 
been key to cementing these bonds, by 
expressing meanings that are difficult or 
impossible to put into words. In that way, 
artistic expression, including music, may 
have helped ensure the survival of the 
fittest. This may also explain why great art 
has such emotional force, because the most 
effective symbols are those that convey 
their messages the most powerfully— 
something the artists at Chauvet Cave seem 
to have understood very well. 

—MICHAEL BALTER 
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Avoiding contamination 
of Mars 
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edited by Jennifer Sills 


Keeping Raw Data in Context 


WE AGREE WITH THE PREMISE STATED IN “MAKING CLINICAL 
data widely available” (Special Section on Clinical Trials, J. 
Kaiser, 10 October 2008, p. 217): Sharing patient-level data 
from human studies would help investigators make more and 
better discoveries more quickly and with less duplication. 
However, this happy circumstance will occur only if investiga- 
tors interpret the raw data properly within the context of the 
study that generated that data. This implies that any initiative to 
share raw clinical research data must also pay close attention to 
sharing clear and complete information about the design of the 
original studies. Relying on journal articles for study design 
information is problematic, for three reasons. First, journal articles often provide insufficient 
detail when describing key study design features such as randomization (/) and intervention 
details (2). Second, some data sets may come from studies with no publications [only 21% of 
oncology trials registered in ClinicalTrials.gov before 2004 and completed by September 2007 
were published (3)]. Finally, investigators cannot reliably search journal articles for method- 
ological concepts like “double blinding” or “interrupted time series,” crucial concepts for 
proper interpretation of the data. A mishmash of nonstandardized databases of raw results and 
unevenly reported study designs is not a strong foundation for clinical research data sharing. 
We believe that the effective sharing of clinical research data requires the establishment of an 
interoperable federated database system that includes both study design and results data. A key 
component of this system is a logical model of clinical study characteristics in which all the data ele- 
ments are standardized to controlled vocabularies and common ontologies to facilitate cross-study 
comparison and synthesis (4). As a first step toward this vision, the Human Studies Database 
Project is developing a standardized representation of study designs for all ongoing human studies 
at NIH-funded Clinical and Translational Science Award (CTSA) institutions, with the ultimate 
goal of using these standards to integrate local institutional databases of human studies for large- 
scale data sharing and reuse (5). This approach would also support legally mandated trial registra- 
tion and results reporting (6), NIH-related data sharing initiatives (7, 8), and requests for raw data 
by biomedical journals (9). To complement work on defining and standardizing study design 
descriptions, national and international policies are still urgently needed to ensure effective, ethical, 
and coordinated sharing of tightly coupled study design and raw patient-level data (/0). 
IDA SIM,?* CHRISTOPHER G. CHUTE, HAROLD LEHMANN,? RAKESH NAGARAJAN,* MEREDITH NAHM,° 
RICHARD H. SCHEUERMANN® 
1Division of General Internal Medicine, Department of Medicine, University of California, San Francisco, CA 94143-0320, 
USA. @Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, MN 55905, USA. Division of Health 
Sciences Informatics, Johns Hopkins School of Medicine, Baltimore, MD 21205, USA. “Division of Laboratory and Genomic 
Medicine, Department of Pathology and Immunology, Washington University in St. Louis, St. Louis, MO 63110, USA. °Duke 


Translational Medicine Institute, Duke University Health System, Durham, NC 27715, USA. °Department of Pathology and 
Division of Biomedical Informatics, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA. 


Data sharing. Shared data will 
only be useful if complete infor- 
mation about the original study 
design is also available. 
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The IRB Is Key 


IN HIS RESPONSE TO A LETTER ON HIS PER- 
spective “Homo experimentalis evolves” (1) 
(31 October 2008, p. 672), J. A. List replies 
that the economic research he conducted did 
not require informed consent by the unwitting 
participants because the study yielded inter- 
esting results and did no harm. The response 
concludes, “Those cases in which there are 
minimal benefits of informed consent but 
large costs are prime candidates for relaxation 
of informed consent.” 

There are certainly grounds for dispensing 
with informed consent in social sciences 
research, and Institutional Review Boards 
(IRBs) have guidelines for doing so. However, 
there are no general rules that cover every 
instance, and the investigator does not have the 
authority to make the decision about when the 
guidelines apply. Instead, exemptions are 
granted by IRBs only on a case-by-case basis. 
In this case, the appropriate question is, did List 
submit his research protocol to his supervising 
IRB and request an exemption? The decision to 
dispense with informed consent is not the 
author’s. 

WILLIAM R. LOVALLO 
Department of Psychiatry and Behavioral Sciences, 
University of Oklahoma Health Sciences Center, VA Medical 


Center, Oklahoma City, OK 73104, USA. E-mail: bill@ 
mindbody1.org 


Reference 
1. J.A. List, Science 321, 207 (2008). 


Response 

AS MY LETTER SUGGESTS, INSTITUTIONAL REVIEW 
Boards (IRBs) are a necessary condition 
and serve an invaluable role; I wrote, “Local 
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Research Ethics Committees and Institutional 
Review Boards in the United States serve an 
important role in monitoring such activi- 
ties.” All of my research has IRB approval, 
and I suspect from W. R. Lovallo’s concerns 
that he and IJ agree on all aspects of subject 
approval. In my own research, I have been 
even more stringent than IRB require- 
ments—I do not deceive subjects and always 
ensure that they are better off due to my 
experiment. These conditions are certainly 
not a constraint in all, or even many, IRBs 
that I am aware of. 

On another note, some researchers do not 
have Local Research Ethics Committees and 
Institutional Review Boards. Outside the 
United States, researchers in the social sci- 
ences must rely largely on their own moral 
principles. I view my letter as also speaking to 
these scholars and letting them know that strict 
standards need to be followed. 

JOHN A. LIST 
Department of Economics, University of Chicago, and 


National Bureau of Economic Research, Chicago, IL 60637, 
USA. E-mail: jlist@uchicago.edu 


Implications of Ancient Ice 


IN THEIR BREVIA “ANCIENT PERMAFROST AND 
a future, warmer Arctic” (19 September 2008, 
p. 1648), D. G. Froese eral. reported the discov- 
ery of an ice wedge more than 740,000 years 
old within the discontinuous permafrost zone 
of central Yukon Territory, Canada. Permafrost 
in this area is warm (with an average tempera- 
ture greater than —2°C) and strongly controlled 
by local site conditions. It is generally sparse or 
absent on south-facing slopes and in areas lack- 
ing insulating vegetation cover. 

Permafrost is very sensitive to climate 
change: If the average annual air temperature 
changes by several degrees, then the temper- 
ature of soil and permafrost soon will change 
by the same amount (/—3). Therefore, a 2°C 
warming there would lead to permafrost 
degradation in the Yukon Territory. By as- 
suming that temperatures during at least two 
previous interglacials were substantially 
warmer than those today, Froese et al. con- 
clude that the persistence of that ice wedge 
implies that permafrost can be more stable 
than previously thought. 

Their conclusion may not be valid, for two 
reasons. First, Froese et al. provide no evi- 
dence that average annual temperatures in the 
Yukon Territory were higher during previous 
interglacials than they are today. Second, there 
is no physically substantiated explanation of 
how permafrost could be stable across such 
climate warming. Permafrost stability during 
warmer interglacials would seem to require 


CORRECTIONS AND CLARIFICATIONS 


Letters: “Unsung hero Robert C. Gallo” by G. Abbadessa et al. (9 January, p. 206). Author Riccardo Dalla-Favera’s name 
was spelled incorrectly. The misspelling has been corrected online. 


Random Samples: “Getting there” (2 January, p. 19). The credit should have been Andrew Nelson/oint Research Center 
of the European Commission. 


News Focus: “Shortfalls in electron production dim hopes for MEG solar cells” by R. F. Service (19 December 2008, p. 
1784) contained several inaccuracies. A 2004 paper by Victor Klimov and colleagues reported multiple exciton generation 
(MEG) in lead selenide nanocrystals, not lead sulfide as reported. The maximum efficiency at the time was 220% for a pho- 
ton energy equal to 3.8 times the nanocrystals’ band gap. This was later increased to 268% at 4.3 times the band gap. (The 
700% efficiency reported was produced using photons with 7.8 times the nanocrystal band gap.) More recent results for 
stirred solutions saw the efficiency decline not to 40% as stated, but to 150% for photons 4.3 times the bandgap, only 
slightly higher than the 123% seen by Moungi Bawendi’s group at MIT for the same ratio of the photon energy to energy 
gap. The difference in the MEG effect between different nanocrystal samples in the case of stirred solutions was 30%, not 
300% as reported. 


The sentence “Synthetic differences between samples may have left some with surfaces that enhance the effect” should 
have stated simply that the surface variation between samples likely accounts for some of the variation of the effect. The 
Los Alamos team stirred the samples to keep the nanoparticles from absorbing more than one photon from sequential laser 
pulses, not “more than one photon at a time.” Klimov does not consider the new results “bad news,” but agrees that the 
effect is too small to make a practical difference for current solar cells. The MEG effort at the National Renewable Energy 
Laboratory is led by Arthur Nozik; the group treated solar cells made with nanocrystal films with ethanedithiol, not 
hydrazine. 


An E-Letter with further clarifications has been published online at www.sciencemag.org/cgji/eletters/322/5909/1784a. 


TECHNICAL COMMENT ABSTRACTS 


Comment oN “Human-Specific Gain of Function in a Developmental 
Enhancer” 


Laurent Duret and Nicolas Galtier 


Prabhakar et al. (Reports, 5 September 2008, p. 1346) argued that the conserved noncoding sequence HACNS1 
has undergone positive selection and contributed to human adaptation. However, the pattern of substitution in 
HACNS1 is more consistent with the neutral process of biased gene conversion (BGC). The reported human-specific 
gain of function is likely due to the accumulation of deleterious mutations driven by BGC, not positive selection. 


Full text at www.sciencemag.org/cgi/content/full/323/5915/714c 


RESPONSE TO COMMENT ON “Human-Specific Gain of Function ina 
Developmental Enhancer” 


Shyam Prabhakar, Axel Visel, Jennifer A. Akiyama, Malak Shoukry, Keith D. Lewis, Amy Holt, 
Ingrid Plajzer-Frick, Harris Morrison, David R. FitzPatrick, Veena Afzal, Len A. Pennacchio, 
Edward M. Rubin, James P. Noonan 


Duret and Galtier argue that human-specific sequence divergence and gain of function in the HACNS1 enhancer 
result from deleterious biased gene conversion (BGC) with no contribution from positive selection. We reinforce our 
previous conclusion by analyzing hypothesized BGC events genomewide and assessing the effect of recombination 
rates on human-accelerated conserved noncoding sequence ascertainment. We also provide evidence that AT > GC 
substitution bias can coexist with positive selection. 


Full text at www.sciencemag.org/cgi/content/full/323/5915/714d 
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that the permafrost was covered with a mas- 
sive layer of organic insulator, such as moss or 
peat, in order to prevent summer heating of 
soils. Such a condition seems to be contra- 2 
dicted by the facts that such a cover does not 
exist there today and that there is no evidence 
that widespread fires might have burned any 
such cover had it once existed. Consequently, 
a simpler explanation of why the ice wedge 
did not melt during the past 740,000 years is 
that this territory was never substantially 
warmer then it is today. SERGEY ZIMOV 


Response 

WE THANK S. ZIMOV FOR HIS INTEREST IN 
our Brevia. Zimov does not agree that the per- 
sistence of ice wedges in Yukon Territory for 
more than 700,000 years reflects thermal 
resilience of permafrost more than a few 
meters below the surface. He instead inter- 
prets our finding as evidence that tempera- 
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tures during previous interglacials were no 
higher in the Yukon than those today. 

Zimov states that the temperature of per- 
mafrost will increase soon after surface air 
temperature increases. However, unlike the 
elements of the cryosphere that lack insulating 
surface cover, such as sea ice or glaciers, per- 
mafrost is buffered from air temperatures by 
soil properties, snow cover, vegetation, and 
ground ice. For example, peat, a common 
ground cover in discontinuous permafrost, 
promotes rapid winter cooling and slows sum- 
mer warming, while ice lenses at the top of per- 
mafrost retard thaw due to latent heat effects. 
Field evidence for the resilience of permafrost 


Letters to the Editor 


Letters (~300 words) discuss material published 
in Science in the previous 3 months or issues of 

general interest. They can be submitted through 
the Web (www.submit2science.org) or by regular 


mail (1200 New York Ave., NW, Washington, DC 
20005, USA). Letters are not acknowledged upon 
receipt, nor are authors generally consulted before 
publication. Whether published in full or in part, 
letters are subject to editing for clarity and space. 


is demonstrated from the early Holocene 
warm interval at a site in the discontinuous 
permafrost zone of central Yukon, where depth 
of seasonal thaw (active layer) was only 70 cm 
greater than today (/), despite several centuries 
of warmer-than-present temperatures (2). 

Zimov presents an alternative interpreta- 
tion for the preservation of ancient ice, sug- 
gesting that middle and late Pleistocene 
interglaciations in our study region were 
not substantially warmer than present. 
However, northward expansion of latitudi- 
nal tree-line (3), fossil plant and insect 
assemblages (4), and stratigraphic evidence 
for widespread degradation of shallow per- 
mafrost (5) indicate that temperatures were 
warmer by up to several degrees during the 
last interglaciation. 

In light of the physical properties of per- 
mafrost, proxy paleoenvironmental indicators 
from the last interglacial, and other evidence 
in our Report, we maintain that the persistence 
of 700,000-year-old ice wedges in the discon- 
tinuous permafrost zone reflects the thermal 
resilience of deep permafrost. We note, how- 
ever, that our observations of relict ground ice 
in Yukon Territory are consistent with the thaw 
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of shallow permafrost to a depth of a few 
meters, which we think occurred during the 
last interglaciation (6). Higher air tempera- 
tures certainly will warm and eventually thaw 
shallow permafrost and associated carbon 
reservoirs, but permafrost more than a few 
meters below the surface is likely stable 
through the 21st century. 
DUANE G. FROESE,?* ALBERTO V. REYES,* 
JOHN A. WESTGATE,? SHARI J. PREECE? 
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Comment on “Human-Specific 
Gain of Function in a 
Developmental Enhancer" 


Laurent Duret?* and Nicolas Galtier? 


Prabhakar et al. (Reports, 5 September 2008, p. 1346) argued that the conserved 
noncoding sequence HACNS1 has undergone positive selection and contributed to human 
adaptation. However, the pattern of substitution in HACNS1 is more consistent with the 
neutral process of biased gene conversion (BGC). The reported human-specific gain of 
function is likely due to the accumulation of deleterious mutations driven by BGC, not 


positive selection. 


he characterization of functional non- 
| coding regulatory elements positively 
selected during human evolution is of ma- 
jor importance for understanding the genetic 
basis of human-specific adaptations. One pos- 
sible approach to identify such elements en- 
tails searching for genomic regions that are highly 
conserved across nonhuman vertebrates but 
strongly divergent, that is, rapidly evolving, in 
humans (/, 2). These regions, called HACNSs 
(human-accelerated conserved noncoding se- 
quences) or HARs (human-accelerated regions), 
are good candidates for being regulatory ele- 
ments under positive selection. 

Prabhakar et al. (3) reported the detailed anal- 
ysis of the first of these candidates, the 546—base 
pair (bp) long HACNS1. HACNS/ has accumu- 
lated 16 human-specific changes since the human/ 
chimpanzee divergence, which represents a sub- 
stitution rate four times as high as would be ex- 
pected given the average neutral substitution rate 
in the human genome. Thirteen of the 16 changes 
are clustered in a small region (81 bp) of HACNS/. 
Using a mouse model, the authors demonstrated 
that human HACNSI/ could drive the expression 
of a reporter gene in the mesenchyme of the 
early-developing forelimb and later-developing 
hindlimb in embryos. This pattern of expression 
was very different from the one observed when 
the chimpanzee or macaque HACNS/ sequences 
were assayed. Prabhakar et a/. further showed 
through directed mutagenesis that the 13 changes 
in the 81-bp region are responsible for the differ- 
ence in enhancer activity between human HACNS/ 
and its orthologs in apes. Accelerated sequence 
evolution is a hallmark of positive selection. The 
authors therefore concluded that these 13 sub- 
stitutions have been driven by positive selection. 
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They suggest that these changes may have con- 
tributed to the evolution of human-specific digit 
and limb patterning (3). 

Positive selection, however, is not the only 
possible explanation for accelerated sequence 
evolution. Biased gene conversion (BGC) is a 
neutral process associated with meiotic recom- 
bination, which favors the fixation of AT — GC 
mutations (4). Given that recombination often 
occurs in hotspots (<< 2 kb), BGC can create 
strong substitution hotspots, thereby leading to 
spurious signatures of positive selection (5—7). 
BGC was identified through its effect on neutral 
sites (6-9), but it can also drive the fixation of 
weakly deleterious mutations in functional ele- 
ments (5, /0). Noteworthy features of BGC are 
that its prevalence is particularly high in sub- 
telomeric regions (7) and that it is much more 
strongly associated with the rate of crossovers 
in male than in female germlines (6, 7, 9). We 
have previously shown that among the HARs 
that were proposed as candidates for positive se- 
lection, many show the hallmarks of BGC: There 
is an excess of HARs in regions of high recom- 
bination rate and the pattern of substitution in 
HARs is strongly biased toward GC (5). This 
is precisely the pattern observed in HACNS1. 
First, this element is located in a subtelomeric 
region of chromosome 2, where the rate of male 
crossover is particularly elevated [2.77 cM/Mb, 
compared with 0.98 cM/Mb on average for auto- 
somes; regions with a male crossover rate higher 
than 2.77 cM/Mb represent only 7% of the ge- 
nome (//)]. Second, among the 16 substitutions 
in HACNSI there are 14 AT — GC substitu- 
tions, 2 GC — CG substitutions, but not a sin- 
gle GC — AT substitution. Functional elements 
(coding or noncoding) in mammalian genome 
are not particularly GC-rich, so there is a priori 
no reason why selection should systematically 
favor AT — GC over GC — AT mutations. Con- 
versely, this pattern of substitution is exactly 
the one expected under the BGC model. 

Prabhakar et al. (3) reject neutral hypothe- 
ses for three reasons: (i) the substitution rate 


in HACNS1 is four times the local neutral rate; 
(ii) the authors claim that under the neutral 
BGC hypothesis “one would expect an increase 
in the overall substitution rate across the entire 
region of increased AT — GC substitution” but 
that the human-specific substitution rate is only 
elevated in a narrow 81-bp region of HACNS/ 
and is close to the local average outside this 
region; and (iii) their experiments demonstrate 
that these human-specific substitutions have a 
substantial functional impact. 

The first point is not an argument against the 
BGC model: As with selection, strong BGC epi- 
sodes can result in substitution hotspots (5-7, /2). 
The second point is a misinterpretation of the 
cited article by Galtier and Duret (5). In this 
paper, we indicate that a BGC-driven substi- 
tution hotspot must be GC-biased. However, the 
reverse proposal is not true: Weak BGC can lead 
to an excess of AT — GC substitutions without 
strongly affecting the substitution rate, as shown 
theoretically and empirically (7). Recombination 
hotspots vary in strength, are evolutionarily un- 
stable, and tend to move rapidly (/3). The pat- 
tern presented by Prabhakar e¢ al. [figure 4 in 
(3)] suggests that the evolution of the 81-bp 
segment has been driven by a strong episode of 
BGC due to an intense and/or long-lived recom- 
bination hotspot, in a region otherwise affected 
by weaker recombination hotspots. This is in 
agreement with current knowledge about the 
spatiotemporal distribution of recombination in 
humans (/4). 

Finally, the third point is the most important 
argument raised by the authors: The strong func- 
tional changes associated with the human-specific 
substitutions in HACNS1 imply that its rapid 
evolution was driven by adaptation. This logical 
implication, however, does not necessarily hold; 
not only advantageous substitutions have a pheno- 
typic effect. We know that BGC can overcome 
natural selection and drive the fixation of weak- 
ly deleterious AT — GC mutation (5, /0). The 
fact that HACNS/ is under very strong purifying 
selective pressure in nonhuman vertebrates in- 
dicates that, in most species, mutations in this 
element have some deleterious effect. Hence, 
the most likely interpretation of the observed 
substitution pattern is that the human-specific 
changes correspond to weakly deleterious mu- 
tations, driven to fixation by BGC. In other words, 
the strong functional shift in human HACNS/ 
enhancer activity probably results from the ac- 
cumulation of numerous weakly deleterious 
substitutions. 

Tn conclusion, we contend that the substitution 
pattern in HACNS1 does not support the hypoth- 
esis of positive selection. Although we cannot for- 
mally exclude that HACNS/ somehow contributed 
to human adaptation, the most parsimonious 
interpretation is that the evolution of HACNS/ 
merely reflects the maladaptive consequences 
of recombination hotspots—the Achilles’ heel of 
our genome. 
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Duret and Galtier argue that human-specific sequence divergence and gain of function in the 
HACNS1 enhancer result from deleterious biased gene conversion (BGC) with no contribution 
from positive selection. We reinforce our previous conclusion by analyzing hypothesized BGC 
events genomewide and assessing the effect of recombination rates on human-accelerated 
conserved noncoding sequence ascertainment. We also provide evidence that AT > GC substitution 


bias can coexist with positive selection. 


uret and Galtier (7) suggest that biased 
D gene conversion (BGC) alone drove 

the rapid human-specific evolution of 
human-accelerated conserved noncoding se- 
quence | (HACNS1) (2, 3) and that positive se- 
lection played no role. BGC is a nonadaptive, 
recombination-driven mechanism hypothesized 
to increase the rate of AT — GC substitutions in 
a given interval (4, 5). In support, it has been 
shown that the AT — GC substitution rate is 
more strongly correlated with male recombina- 
tion rate than with GC content (6). Clusters [100 
to 1000 base pairs (bp)] of human-specific sub- 
stitutions occurring in regions of elevated male 
recombination rate show an AT — GC bias, 
which suggests that BGC may have contributed 
to some substitution hotspots in the genome (5). 
These correlations are stronger at the 1 Mb scale 
than at finer scales, likely because fine-scale re- 
combination rates are less stable over time than 
coarse-scale rates (5, 7, 8). In addition, human- 
accelerated regions (HARs) identified in a whole- 
genome screen of conserved genomic elements 
have an excess of AT — GC substitutions and are 
enriched in broad regions of high average recom- 
bination (4, 9, /0). 

In light of these findings, Duret and Galtier's 
observation that some HARs and HACNSs could 
be due solely to BGC is reasonable (4). However, 
their present attempt to extrapolate this argument 
to HACNS/ requires several unsupported assump- 
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tions regarding the ascertainment of HACNSs 
compared with HARs and the known distribution 
of recombination hotspots in the genome. We 
examine the authors’ claims below. 

The robust gain of limb expression in HACNS/ 
is not in itself evidence of adaptation. However, 
the most likely effect of deleterious substitution 
in the enhancer would be to randomly distort the 
existing function rather than to strengthen the an- 
cestral expression pattern while introducing a new, 
robust expression domain. The precise, highly 
spatially restricted gain of function in HACNS/ 
is exactly what one would expect from an adapt- 
ive process that altered a specific functional mod- 
ule in the enhancer, as suggested by the pattern 
of substitutions we observed, while leaving the an- 
cestral expression domains intact. Therefore, we 
find the adaptive hypothesis qualitatively more 
parsimonious than the authors’ alternative (/). 

Overrepresentation of HARs in subtelomeric 
regions, which display high recombination rates, 
has been cited as evidence that some HARs are 
due to BGC rather than to positive selection (4, 10). 
However, in assuming that HACNSs suffer from 
a similar systematic bias, Duret and Galtier mis- 
understand how HACNSs were identified. In con- 
trast to HARs, which were ascertained relative 
to whole-genome averaged substitution rates, 
HACNSs were defined based on their acceler- 
ation relative to the local neutral substitution 
rate (2, 9). Thus, the method used to identify 
HACNSs mitigates the local average effect of 


Table 1. Lack of HACNS enrichment in regions 


recombination-driven BGC on substitution rates. 
Although some HACNSs may be due to BGC, a 
comparison of HACNS distribution relative to 
male-specific recombination rates demonstrates 
that HACNSs as a class show no enrichment in 
regions of high recombination (Table 1) (//). 
Moreover, the fraction of conserved noncoding 
sequences within a 10-Mb window centered on 
HACNS! that are HACNSs (0.8%) is lower than 
the whole-genome average (0.9%). The elevated 
male-specific recombination rate around HACNS/ 
thus has no bearing on whether HACNS1 is af- 
fected by BGC. 

Because HACNSs show no correlation with 
coarse-scale recombination rates, Duret and Galtier's 
claim that BGC explains human-specific accel- 
eration in HACNS/ rests on a single point: the 
excess of AT — GC substitutions in the enhancer. 
As we discussed in (3), this AT — GC excess is 
not confined to the 546-bp HACNS/ element; in- 
stead, it extends into the nonconserved flanking 
regions, covering ~5 kb in all (3). However, the 
overall human-specific substitution rate is close 
to the local (1 Mb) average rate across nearly 
this entire region, with one critical exception: the 
sharp spike of 13 substitutions in a highly con- 
strained 81-bp subregion within HACNS/. 

Duret and Galtier (7) propose a nonparsimo- 
nious “BGC only” scenario for the spatial pro- 
file of substitutions in and around HACNSI: a 
sharp (~80 bp), extremely strong recombination 
hotspot embedded in a larger (~5 kb) weak hot- 
spot. In support, they claim that such a spatial 
profile is “in agreement with current knowledge 
about the spatiotemporal distribution of recom- 
bination in humans” and reference Myers et al. 
(12). However, this is a misinterpretation, because 
that study analyzed recombination rates at a low 
resolution of ~2 kb and therefore cannot provide 
insight on the likelihood of ~80-bp hotspots within 
larger “warmspots.” Moreover, the authors’ pro- 
posed BGC-driven cluster of 13 substitutions in 
81 bp is implausibly narrow and dense, even when 
compared with the 200 most extreme putative 
BGC-driven substitution hotspots in the genome, 
which average ~370 bp and have an average sub- 
stitution density equivalent to <5 substitutions in 
81 bp (5). 

We reject the authors’ assumption that BGC 
and adaptation are mutually exclusive. The high- 
ly localized spike in the substitution rate in HACNS/ 
is more consistent with positive selection than 
with BGC. However, the AT — GC substitution 


of high male-specific recombination (21). 


Fisher 
All CNSs HACNSs twscided B 
Total in genome 110,549 992 
No. in regions with 13,542 135 0.25 
male recomb >2.0 cM (12.3%) (13.6%) . 
No. in regions with 9,330 87 0.73 
male recomb >2.5 cM (8.44%) (8.77%) , 
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bias in the rapidly evolving segment of HACNS/ 
and in the ~5-kb flanking sequence where the 
overall substitution rate is not elevated suggests 
that BGC may have influenced the evolution of 
this enhancer. The substitution pattern in HACNS/ 
may thus be due to positive selection synergiz- 
ing with a moderate BGC domain. A recombi- 
nation hotspot of mild strength, that extended 
over ~5 kb and elevated the fraction of AT + GC 
substitutions without increasing the overall sub- 
stitution rate, would explain the observed substi- 
tution pattern in the neutral sequence surrounding 
HACNS1. Within HACNS1, positive selection for 
new GC-rich transcription factor binding sites or 
loss of AT-rich sites likely amplified a modest 
AT — GC fixation bias caused by BGC, produc- 
ing a sharp increase in the substitution rate with- 
in an ~80 bp cluster of binding sites that generated 
the human-specific gain of limb expression. 
Evidence that an extreme AT — GC substitu- 
tion bias can coexist with positive selection comes 
from a 382-bp putative BGC event (chr18:897,475- 
897,857 in hg18) identified by Dreszer et al. (5), 
which contains 19 substitutions and overlaps 
exon 2 of ADCYAP1. In parallel to HACNS1/, the 


human-specific substitutions comprising this BGC 
event are significantly clustered in functional se- 
quence: The 132-bp exon contains 13/19 substi- 
tutions (Fisher’s exact test, P = 0.002), despite 
being conserved across terrestrial vertebrate ge- 
nomes. All 13 substitutions are AT — GC and 
produce 11 amino acid changes. The human- 
specific nonsynonymous substitution rate in the 
exon is well above the synonymous rate (dN/dS = 
3.2), a classic signature of adaptive evolution (/3). 
It is difficult to imagine how neutral or deleteri- 
ous BGC could selectively favor nonsynonymous 
substitutions. We hypothesize that accelerated 
evolution in ADCYAP/ and HACNS/ are due to a 
common mechanism: synergy between BGC and 
positive selection producing a cluster of AT — GC 
substitutions at functional sites. 

In summary, we find no basis for the authors’ 
claim that the gain of function in HACNSI/ is 
solely due to BGC. Our analysis reinforces our 
initial conclusion, that positive selection likely 
played a role in HACNS/ evolution. 
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EVOLUTION 


The Overwhelming Evidence 


Massimo Pigliucci 


t had to be done, and Jerry Coyne is 
unquestionably one of the most qualified 
people for the job. I am referring to a clear, 
engaging, accessible explanation of the evi- 
dence for evolution, an aspect of the so-called 
evolution-creation “controversy” that is too 
often neglected. There are, of course, plenty of 
books criticizing creationism and its cousin 
intelligent design (/—2) as well as works aim- 
ing to explain the creationist phenomenon 
within the broader context of American anti- 
intellectualism (3). We can also easily find 
plenty of superb books for the public about 
various aspects of evolutionary biology [e.g., 
(4-6)] even beyond the classical essays by 
Stephen Gould and Richard 
Dawkins. And yet, it is hard to 
get one’s hands on a good 
non-—college-level presenta- 
tion of why evolution is, as 
they say, both a theory and a 
fact. Coyne’s Why Evolution 
Is True begins to fill this obvi- 
ous lacuna, even though— 
just like in other branches of 
science—additional popular 
writing by scientists and well- 
informed journalists on evo- 
lution will be welcome for 
many years to come. \ 
The first eight chapters \ 
span pretty much everything be 
one may want to know about 
evolution but, apparently, so 
few dare to explain. Coyne | 
(an evolutionary geneticist at \ 


; we 
\ 


\mn 


the University of Chicago) hi ie 
first introduces readers to a “ye 


basic definition of evolution. 

With an expert hand, he then leads them 
through discussions of the paleontological 
evidence, developmental biology and vestig- 
ial organs (and molecules), biogeography, nat- 
ural selection (“the engine of evolution”), sex- 
ual selection, species concepts, and speciation 
processes. The latter two entries should actu- 
ally be read in the singular, because Coyne 
subscribes to one species concept (based on 
reproductive isolation) and one chief mode of 
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speciation, allopatry. He 
and I disagree on this and 
on other aspects of cur- 
rent evolutionary theory, 
but this is not the place to entertain technical 
arguments at the cutting edge of the field. 
Still, readers of Coyne’s book will get a fairly 
conservative version of evolutionary theory, 
with occasional hints about the many heated 
discussions that characterize any live science 
and that eventually fuel its progress toward a 
better understanding of the natural world. 
Perhaps the best part of the book is the one 
that addresses the real issue: human evolution. 
It has been noted many times that if evolution- 


Human, Australopithecus, and chimpanzee. 


ary biologists limited themselves to the non- 
human world, very few people would pound 
their fists on the pulpits of fundamentalist 
churches to decry “the evil doctrine.” But 
when science strikes at our own cherished 
self-image, then the trouble begins. Just ask 
Copernicus and Galileo. Coyne is clear and 
convincing here, at least for those sufficiently 
open-minded to read the book seriously. He 
makes a good use of narrative and images, and 
one of the latter truly is worth a thousand or 
more words: Early on in the chapter on human 
evolution, Coyne simply puts side by side 
the complete skeletons of a modern human, 
an Australopithecus afarensis, and a chim- 


panzee. The message is not, obvi- 
ously, that A. afarensis is a “missing 
link” between us and the chimps, 
because the latter two species are 
contemporary and modern whereas 
A. afarensis is ancient. But it takes a 
particularly obtuse mind to look at 
the figure and reject the notions that 
A. afarensis is a member of the 
human lineage and that we and 
chimps have evolved from a com- 
mon ancestor. Then again, there is no dearth of 
obtuse minds when it comes to creationism. 

The problem with the creation-evolution 
issue, however, is that it is not about the evi- 
dence. The clash is not a scientific debate, it is 
a social controversy. Coyne understands this, 
and he begins his last chapter by recounting 
the story ofa public lecture he gave about evo- 
lution and intelligent design. Afterward he 
was approached by someone in the audience 
who frankly stated: “I found your evidence for 
evolution very convincing—but I still don’t 
believe it.” What is a scientist to do? Coyne 
admits that the issue goes far beyond science, 
into philosophy and questions of meaning and 
morality. Which is why philosophers have 
been very helpful in this arena during the past 
several years. It is a matter of explaining to the 
public not just the power but the limits of sci- 
ence. Coyne is critical, for instance, of much 
evolutionary psychology and the facile just-so 
stories that have abounded of late to “explain” 
all sorts of human behaviors, from rape to 
depression. I’m with him on this. The point is 
not that aspects of human behavior did not 
evolve by natural selection, but rather that the 
usually high standards of behavioral genetics 
are simply not met by most, though not all, the 
evolutionary psychology literature. 

Still, creationists have had a problem with 
evolution ever since Darwin and will continue 
to challenge its teaching in public schools out 
of the same paranoia about moral decay that 
gripped Victorian discussions of The Origin of 
Species. What we need is a cultural change, 
which notoriously takes a long time and is not 
just a matter of presenting the evidence for the 
rational position and walking away while pat- 
ting oneself on the back for a job well done. 
Nonetheless, we must present the evidence, and 
Jerry Coyne’s book does an excellent job of it. 
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EVOLUTION 


On the Metazoan Tree 


Wallace Arthur 


nlike many books, Perspectives in 
| | Animal Phylogeny and Evolution has 
a remarkably accurate title. Ales- 
sandro Minelli (an evolutionary biologist at 
the University of Padova, Italy) presents per- 
spectives rather than “facts,” because our 
view of the structure of the animal kingdom 
is still changing, and he dis- 
cusses both that structure 
(phylogeny) and the mecha- 
nisms of evolution through 
which it has come about. 

The book deals separately 
with evolutionary pattern and 
process, in that order. The very 
different flavors of the two sets 
of chapters suggest a parallel 
between the organization of 
the book and the organization 
of animals that Minelli calls 
“the double animal.” That refers 
to the quasi-independence of parts derived 
from ectoderm or mesoderm and from endo- 
derm (e.g., the many cases where the ectoder- 
mal or mesodermal structures are segmented 
while endodermal structures, such as the 
gut, are not). 

Two commendably brief and effective 
introductory chapters indicate how Minelli 
sees the relation between evolutionary pattern 
and process. One theme that emerges here is 
that the reasons for the evolutionary origin of a 
structure can rarely be found in the functions 
of its more elaborate, much later form. For ex- 
ample, feathers did not begin to evolve because 
of selection for the ability to fly. To put it 
another way (one that would have appealed to 
Stephen Jay Gould), exaptation is everywhere. 

Minelli makes it clear that he sees phy- 
logeny as a backcloth against which evolu- 
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tionary processes can be better 
understood. This “pattern be- 
fore process” argument is fa- 
miliar to us from cladistics. But 
Minelli subsequently goes much 
further than many cladists into 
evolutionary processes, and I 
confess that I find this side of 
things (both in the book and in 
general) the more interesting. 

That’s not to say that there is 
nothing of interest in the phylo- 
genetic chapters. At their outset, 
Minelli reminds us that there is 
little evidence in the fossil 
record for animal life before 
about 550 million years ago 
(Ma) and nothing at all before 
the base of the Vendian period 
(about 650 Ma). This reminder 
is timely, as it is becoming less and less likely 
that the 1990s molecular clock estimates of 
billion-year-old divergences between major 
animal groups (such as protostomes and 
deuterostomes) are correct. 

With regard to the branch- 
ing pattern of basal metazoans, 
Minelli discusses various pos- 
sibilities, including the recent, 
unexpected placement of comb 
jellies (Ctenophora)—rather 
than the sponges (or some of 
them)—as the sister group to 
all other animals. I am skepti- 
cal of this suggestion, and, as 
Minelli emphasizes, the jury is 
still out on it. Nonetheless, the 
surprising frequency with 
which such new possibilities appear in the lit- 
erature reminds us of how important a cau- 
tious, perspectives approach is. 

As Minelli pushes on toward what we 
might want to call (although he doesn’t) more 
advanced animals, the book is readable 
enough to keep right on going. However, I sus- 
pect that most readers will instead use these 
chapters, especially “A gallery of the major 
bilaterian clades,” for reference. In any event, 
these chapters lay the foundation for what fol- 
lows, the three chapters on processes. 

These offer lots of interesting ideas, all 
eloquently explained. I don’t accept all of 
them, but I don’t imagine that Minelli would 
expect me to. There are also instances where I 
agree with his main point but not with his 
underlying reason. For example, he states that 
“we still have no satisfactory and comprehen- 
sive theory of development,” a situation he 
largely attributes to “the near-universal adul- 
tocentric attitude adopted until now.” I accept 
the second statement—many biologists 


On the basal Bilateria branch? The 
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regard developmental 
stages as functioning 
simply “to produce an 
adult” rather than as 
organisms having (es- 
pecially in the case of 
larvae) their own lives. 
But I believe the main 
reason we lack a com- 
prehensive theory of 
development is that we 
don’t understand the 
quantitative dynamics 
of gene-protein interac- 
tions; we are still largely 
restricted to a more quali- 
tative view in which in- 
teractions are described 
in a binary way, as acti- 
vations or inhibitions. 

Similarly, I like very much Minelli’s con- 
cept of paramorphism, perhaps best explained 
in relation to limbs. As he points out, with very 
few exceptions segmented animals have seg- 
mented limbs and unsegmented animals have 
unsegmented limbs. That suggests the limbs 
are “axis paramorphs”: the genetic machinery 
used to make the trunk has been reused to pro- 
duce them. Minelli sees paramorphism as an 
alternative to gene cooption, whereas I would 
see it as a version of that. In any event, it is an 
important concept. 

Minelli ends with five “guidelines for 
future investigations into animal evolution.” 
One is to abandon the adultocentric view 
and recognize that “the fundamental unit of 
evolution is represented by the life cycle as 
an integrated whole,” with each stage sub- 
ject to mutationally introduced and selec- 
tively driven changes. This position has 
already been accepted by many, and it can be 
seen as a way to unite evo-devo and popula- 
tion genetics. 

The book’s one-to-five ratio of references 
to text suggests a wide-ranging scholarship 
presented in a style of writing sometimes 
described as “spare”—the opposite of ver- 
bose. And indeed that is exactly what Minelli 
offers. Whether or not one agrees with his 
various perspectives on phylogeny and evo- 
lutionary mechanisms, the book is easily rec- 
ognized for what it is: a magnificent tour of 
metazoan relationships, characterized by a 
cautious, nondogmatic approach to both pat- 
tern and process. Minelli knows how much 
we don’t yet know for sure, and he suggests 
interesting and potentially productive ways 
forward. This concise account easily can and 
should be read by all serious students of ani- 
mal phylogeny and evolution. 
10.1126/science.1170743 
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PLANETARY SCIENCE 


Biologically Reversible Exploration 


Christopher P. McKay 


he international Committee on Space 
| Research (COSPAR) has established 
a “planetary protection” policy that 
involves not contaminating other worlds in a 
way that would jeopardize the conduct of 
future scientific investigations. As a signatory 
to the 1967 Outer Space Treaty, the United 
States is required by article IX to avoid “harm- 
ful contamination” of the other worlds of the 
Solar System. However, further revisions to 
the policy are needed. 
The two Viking landers that arrived on 
Mars in 1976 were heat-sterilized to comply 
with planetary protection. After Viking, 


COSPAR determined that given the inhos- 
pitable conditions on Mars, sterilization was 
no longer required unless the spacecraft 
specifically intended to search for life as one 
of its scientific goals. Thus, the Pathfinder 
spacecraft and the two Mars Exploration 
Rovers (Spirit and Opportunity) arrived at 
Mars with an estimated bioburden exceeding 
many hundreds of thousands of bacteria (/) 
(see photo). 

In 2003, COSPAR revised its planetary 
protection policy for Mars from a probabilis- 
tic approach to one that was based on the mis- 
sion objectives and target (2). It includes the 
notion of “special regions,” regions in which 
biocontamination from Earth might grow and 
thus require special protection. Any spacecraft 
components that enter such a special region 
have to be heat-sterilized. The subsurface ice 
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The heat shield from the Mars Exploration Rover Opportunity photographed 
in February 2005. Debris on Mars could shield microorganisms from UV light. 


that was the target for the arm on the Phoenix 
lander is a special region, and the arm was 
therefore heat-sterilized and placed within a 
biobarrier bag before launch to Mars in 2007. 
Pieces of the rest of the Phoenix lander, which 
were undoubtedly contaminated, are visible 
on the surface of Mars. 

Any hitchhiking organisms exposed to 
the martian environment are killed in min- 
utes by the ultraviolet radiation from the Sun 
(3). Bacteria shielded inside the spacecraft 
would not be killed but would remain dor- 
mant because of the dry conditions. Over 
hundreds of thousands of years, they would 
be killed by galactic cos- 
mic radiation (4). 

In 2006, when the U.S. 
National Research Council 
emphasized the importance 
of special regions on Mars 
(5), it also recommended 
that National Aeronautics 
and Space Administration 
(NASA) work with COSPAR 
and other organizations to 
convene an international 
workshop that would focus 
on “ethical implications 
and the responsibility to 
explore Mars in a manner 
that minimizes the harmful 
impacts of those activities 
on potential indigenous biospheres.” This 
would include discussion of whether revisions 
to current planetary protection policies are 
needed and how to involve the public in dis- 
cussions about related ethical issues. This was 
the first official suggestion that consideration 
should be given to any indigenous life on 
Mars even if it is microbial (6). This workshop 
is planned for 2009. 

What do we do if we find life on Mars? It is 
possible that martian life is on the same tree of 
life as Earth life because of the exchange of 
meteorites between the two planets (7, 8). 
Alternatively, it may be that life on Mars rep- 
resents a second genesis—an independent ori- 
gin of life (9). Contamination by even one 
Earth bacterium may be a serious issue of 
environmental ethics. Furthermore, if we find 
evidence of a second genesis, then this may 
open discussions of warming Mars to help 
that alien life to flourish (/0, 17). Scientists 
and policy-makers who consider this choice 


International policies for protection of Mars and 
other planets from biological contamination 
need to be maintained and strengthened. 


will have to deal with any contamination left 
on Mars by previous explorers, so that it does 
not flourish instead. 

Sterilization of robotic spacecraft, while no 
longer policy, is at least possible. With human 
exploration, sterilization is not an option. Nor is 
it realistic to imagine that a human base could 
be so carefully engineered that it would release 
no microorganisms into the environment. 

The spacecraft that have landed on Mars 
have all been surface missions. Contaminants 
will remain local and static and can be re- 
moved without requiring an effort vastly 
larger than the missions that carried the con- 
tamination. Even at the crash sites, debris 
from Earth extends no more than a few meters 
into the surface. Reversing the contamination 
involves recovering the spacecraft parts and 
exposing any contaminated dirt to the steriliz- 
ing ultraviolet (UV) sunlight. However, if, for 
example, robotic or human explorers drill to 
investigate a subsurface aquifer, biologically 
reversible exploration would require rigorous 
sterilization of any components that go down 
the drill hole. Similarly, if human explorers 
establish bases inside caves (/2), the naturally 
sterilizing effect of the surface UV would be 
lost, and contamination would be persistent. 

We should not do anything now that would 
close off options for the future. I propose that 
COSPAR, in its upcoming discussions, set a 
policy that all Mars exploration be biologically 
reversible and that this policy extend to human 
exploration as well. 
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ASTRONOMY 


Forming Massive Stars 


Barbara Whitney 


he dominant physical pro- 
cesses contributing to the for- 
mation of a star are gravity, 
rotation, and magnetic fields acting 
on a cloud of dense gas and dust. 
Magnetic fields and pressure from the 
gas inhibit the collapse, but once a 
cloud is dense enough to overcome 
these forces, collapse begins and grav- 
ity and rotation dominate. A starlike 
core forms at the center and grows as 
mass is accreted from the infalling gas 
cloud. This is just classical mechanics, 
mathematically formulated by Isaac 
Newton more than 300 years ago. For 
stars with a mass similar to that of our 
Sun, a theory for the entire sequence 
from a dense molecular cloud core to 
a hydrogen-fusion star like our cur- 
rent Sun has been well developed and 
tested with observations (/). For stars 
of about 20 to 100 solar masses, how- 
ever, such a simple consideration ren- 
ders them unstable, and thus a more 
sophisticated treatment is required. 
On page 754 of this issue, Krumholz 
et al. (2) introduce a time-dependent 
model that explains the physical 
processes involved in the formation 
of such massive stars. 

For stars about 20 times as massive 
as our Sun, the outward pressure force 
exerted by photons on the grains of 
dust in the cloud must be considered 
(3). A simple calculation balancing 
gravity and the radiation force finds 
that this radiation pressure halts the 
infall onto the star once it reaches 20 solar 
masses. Yet, stars with masses as high as 120 
times that of our Sun have been observed (4). 
For most of the 20th century, the theoretical 
models of star formation assumed spherical 
geometries, which are one-dimensional. 
Adding rotation to the equations makes the 
problem two-dimensional and leads to forma- 
tion of a disk. These two-dimensional numer- 
ical models found that stars up to 40 solar 
masses could form from accretion through the 
disk, whereas the radiation pressure acts more 
on the polar regions (5). Now, it turns out that 
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A variety of stars. A 1.5 by 1.5 degree region on the sky (about 750 by 
750 light-years in size) in our Galactic plane. Red emission shows hot 
dust and gas; green emission shows emission from large molecules at the 
surface of molecular clouds (the neutral region between the ionized hot 
gas and cold molecular gas). The blue sources are normal stars like our 
Sun. The whitish, orange, and red sources are forming stars, most of 
them more than 10 times as massive as our Sun. The white arrows show 
regions of massive (and low-mass) star formation. The blue arrows show 
a region where star formation appears to be inhibited, probably due to 
strong magnetic fields and hot temperatures caused by colliding flows of 
gas and dust. The structures form a ringlike shape that may have been 
formed by a previous generation of massive stars with strong radiation 
fields and winds, or even a supernova explosion. The star formation 
(white arrows) may be triggered by the compression in the ring caused 
by the previous generation of massive stars. 


newly developed three-dimensional numeri- 
cal models allow another physical process into 
the equations, namely instabilities, which then 
allow even more massive stars to form. 
Krumholz et al. describe a three-dimen- 
sional simulation of the first 50,000 years of 
the collapse of a 100 solar mass cloud of 
molecular gas and dust in slow rotation before 
collapse. The slow rotation at large distances 
becomes fast rotation as the cloud collapses as 
aresult of conservation of angular momentum, 
and thus the cloud quickly collapses into a cen- 
tral “proto-star” and disk in about 4000 years. 
Because of the large pile-up of material onto 
the disk, two-armed spiral instabilities form 
after about 20,000 years; this transports angu- 
lar momentum outward, allowing more mate- 


PERSPECTIVE 


Sophisticated numerical simulations can now 
describe the formation of supermassive stars. 


rial to fall inward onto the star, 
increasing its mass further. After 
about 17,000 years, when the central 
object has reached about 17 solar 
masses, the outward radiation pres- 
sure begins to exceed gravitational 
force, but only in certain directions, 
mainly perpendicular to the disk 
direction, forming radiation bubbles. 
However, infalling material can still 
flow around the bubbles to reach the 
central object. The bubbles develop 
deformities, and dense fingers of 
material continue to fall in. These 
fingers are analogous to the well- 
studied Rayleigh-Taylor instabilities 
in classical fluid mechanics. The net 
effect is that more mass falls in than 
is pushed out by radiation pressure. 
The rate at which mass falls in is 
variable. In their simulation, the disk 
instabilities cause a second smaller 
star to form in the disk. At the end of 
the simulation, the masses of the stars 
were 33 and 47 solar masses, with the 
other 20 solar masses still in the disk 
and infalling envelope, and their 
future uncertain. 

In addition to overcoming previ- 
ous theoretical problems for forming 
massive stars, the simulation of 
Krumholz et al. reproduces an ob- 
served feature of massive stars, and 
that is the very common occurrence 
of binaries. The accretion variability 
predicted by the simulation caused 
by the instabilities should be testable 
by searching for variation in the observed 
fluxes of massive protostars. 

Understanding the formation of massive 
stars is a fundamental goal in astronomy. 
Massive stars form the heaviest elements from 
nuclear burning in their cores and in super- 
novae explosions after fuel in the core is 
exhausted. Their winds and luminosity have a 
profound effect on their surrounding environ- 
ments, churning up the gas and dust, and both 
stimulating and regulating new star forma- 
tion. Examples of the turbulent processes 
caused by the life cycle of massive star forma- 
tion (birth and death) can be seen in the figure. 

The simulation by Krumholz et al. took 
about 40 days of computing time on 256 
processors running simultaneously. This is 
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hopefully only the beginning of even more 
sophisticated simulations. Many intriguing 
open questions remain, such as, how does 
the central protostar evolve as it starts 
nuclear burning of hydrogen in its core 
while its surface continues to accrete from 
the disk and envelope? (A more compact, 
hotter star will ionize the surrounding gas 


and have a different effect on the environ- 
ment than a larger, cooler star of the same 
mass and luminosity.) At what point do the 
stellar winds and radiation disrupt the disk 
and envelope? Can the disk survive long 
enough to form planets, or is the realm of 
planet formation available only to low-mass 
stars like our Sun? 
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EVOLUTION 


Time's Stamp on Modern 


Biogeography 


J. Alistair Crame 


hy are there more types of plants 
and animals on some parts of the 
Earth’s surface than on others, and 


do these patterns point to the operation of 
some basic laws of nature that have yet to be 
fully understood? Over the past three decades, 
answers to these questions have shifted from 
essentially ecological to much more evolu- 
tionary (/). This shift largely results from the 
rapid growth in phylogenetic studies, which 
has led to a proliferation of time-calibrated 
evolutionary trees, and from the advent of 
large, analytical databases. Using the latter, it 
is now possible to assemble vast amounts of 
spatial and temporal data on the distribution of 
plants and animals to establish statistically 
significant biogeographic patterns. On page 
767 of this issue, Krug et al. use this approach 
to provide important new insights into how 
biodiversity evolved over the past 200 million 
years (2). 

There are two critical patterns in taxo- 
nomic diversity: the trend in the total number 
of taxa through geologic time, and the latitu- 
dinal gradient from the tropics to the poles. 
These two features are undoubtedly linked. 
If the number of species (or genera) has 
increased dramatically through time (espe- 
cially over the past 65 million years, referred 
to as the Cenozoic era), then it is almost cer- 
tain that latitudinal gradients must have steep- 
ened too, because the fossil record strongly 
suggests that many groups originated in tropi- 
cal and subtropical regions. However, if global 
diversity has been relatively constant for much 
of the past 300 million years, then the steep 
gradients that we see today can be regarded as 
essentially time-invariant features of Earth’s 
surface. Is the strong diversity contrast 
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A study of living bivalves provides clues to 
the evolution of biodiversity over the past 200 
million years. 


major new database survey of 
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Age-frequency distributions of marine bivalve genera. The curves 
plot the ages of genera, ranked from youngest to oldest, in the tropical 
West Pacific and Arctic bioprovinces. The vertical line marks the K-Pg 
boundary at 65 million years ago. The slopes of the two provinces differ 
slightly before the K-Pg boundary, but diverge strongly following the 
mass extinction, as many more new species evolved in the tropics than in 


the Arctic. [From fig. $3 in (3)] 


between the tropics and the poles really only a 
feature of the youngest part of the fossil 
record, or has it perhaps always been there? 
According to the conventional view of bio- 
diversity through time, biodiversity rose rap- 
idly from 542 to 445 million years ago, fol- 
lowed by a somewhat irregular plateau until 
about 250 million years ago. A steep fall at the 
Permian-Triassic mass extinction event was 
soon reversed, and the greater part of the 
past 250 million years was marked by a 
steep rise that was barely deflected by the 
Cretaceous—Paleogene (K-Pg) mass extinc- 
tion. The increase in the number of species 
through the past 65 million years alone could 
have been by an order of magnitude (3, 4). 
Our faith in this view was, however, 
severely tested when Alroy ef al. published a 


that the younger part of the 
Cenozoic record (i.¢., the last 
23 million years) is poorly 
sampled, and this is particu- 
larly so of the tropics (6). To 
counter this, Alroy et al. later 
quadrupled the size of their 
database and produced a new 
curve based on substantially 
better coverage, both strati- 
graphically and geographically, of the entire 
Cenozoic (7). This curve again shows only a 
very modest rise over the past 65 million 
years; in terms of the number of genera, the 
peak over the last 23 million years is only 1.74 
times as high as that of approximately 400 
million years ago. Furthermore, Alroy ef al. 
(7) found evidence of steep latitudinal gradi- 
ents as far back in time as 475 million years 
ago. Although these are shallower than those 
of the last 23 million years, the ratio of tropi- 
cal to temperate faunas seems to be very sim- 
ilar. These authors suggest that gradients must 
have developed through time by the parallel 
development of both tropical and temperate 
faunas, and not by tropical radiations alone. 
Krug et al. now provide a somewhat differ- 
ent perspective on the evolution of modern 
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biogeographic patterns. Their study, too, is 
centered on a large database, but in this case it 
is entirely of living organisms, the marine 
bivalves. Over 28,000 records of bivalve gen- 
era and subgenera from 322 locations around 
the world have now been compiled by these 
authors, giving a global record of some 854 
genera and subgenera and 5132 species. No 
fossils are included in the database, but 
because bivalves have a good fossil record, it is 
possible to estimate accurately the age of ori- 
gin of almost all extant genera. It is then possi- 
ble to plot a backward survivorship curve (8) 
for each of the 27 global bivalve provinces (9). 

On the basis of these curves, Krug ef al. find 
that origination rates of marine bivalves in- 


creased significantly almost everywhere im- 
mediately after the K-Pg mass extinction event. 
The highest K-Pg origination rates all occurred 
in tropical and warm-temperate regions. A dis- 
tinct pulse of bivalve diversification in the early 
Cenozoic was concentrated mainly in tropical 
and subtropical regions (see the figure). 

The steepest part of the global backward 
survivorship curve for bivalves lies between 65 
and 50 million years ago, pointing to a major 
biodiversification event in the Paleogene (65 to 
23 million years ago) that is perhaps not yet 
captured in Alroy et al.’s database (5, 7). The 
jury is still out on what may have caused this 
event. But we should not lose sight of the fact 
that the steep rise to prominence of many mod- 
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ern floral and faunal groups in the Cenozoic 
may bear no simple relationship to climate or 
any other type of environmental change (/0, //). 
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e live life in the network. We check 

our e-mails regularly, make mobile 

phone calls from almost any loca- 
tion, swipe transit cards to use public trans- 
portation, and make purchases with credit 
cards. Our movements in public places may be 
captured by video cameras, and our medical 
records stored as digital files. We may post blog 
entries accessible to anyone, or maintain friend- 
ships through online social networks. Each of 
these transactions leaves digital traces that can 
be compiled into comprehensive pictures of 
both individual and group behavior, with the 
potential to transform our understanding of our 
lives, organizations, and societies. 

The capacity to collect and analyze massive 
amounts of data has transformed such fields as 
biology and physics. But the emergence of a 
data-driven “computational social science” has 
been much slower. Leading journals in eco- 
nomics, sociology, and political science show 
little evidence of this field. But computational 
social science is occurring—in Internet compa- 
nies such as Google and Yahoo, and in govern- 
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ment agencies such as the U.S. National Secur- 
ity Agency. Computational social science could 
become the exclusive domain of private com- 
panies and government agencies. Alternatively, 
there might emerge a privileged set of aca- 
demic researchers presiding over private data 
from which they produce papers that cannot be 


A field is emerging that leverages the 
capacity to collect and analyze data at a 
scale that may reveal patterns of individual 
and group behaviors. 


critiqued or replicated. Neither scenario will 
serve the long-term public interest of accumu- 
lating, verifying, and disseminating knowledge. 

What value might a computational social 
science—based in an open academic environ- 
ment—offer society, by enhancing understand- 
ing of individuals and collectives? What are the 


Data from the blogosphere. Shown is a link structure within a community of political blogs (from 2004), 
where red nodes indicate conservative blogs, and blue liberal. Orange links go from liberal to conservative, 
and purple ones from conservative to liberal. The size of each blog reflects the number of other blogs that 
link to it. [Reproduced from (8) with permission from the Association for Computing Machinery] 
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obstacles that prevent the emergence of a com- 
putational social science? 

To date, research on human interactions has 
relied mainly on one-time, self-reported data 
on relationships. New technologies, such as 
video surveillance (/), e-mail, and “smart” 
name badges, offer a moment-by-moment pic- 
ture of interactions over extended periods of 
time, providing information about both the 
structure and content of relationships. For 
example, group interactions could be examined 
through e-mail data, and questions about the 
temporal dynamics of human communications 
could be addressed: Do work groups reach a 
stasis with little change, or do they dramatically 
change over time (2)? What interaction patterns 
predict highly productive groups and individu- 
als? Can the diversity of news and content we 
receive predict our power or performance (3)? 
Face-to-face group interactions could be 
assessed over time with “sociometers.” Such 
electronic devices could be worn to capture 
physical proximity, location, movement, and 
other facets of individual behavior and collec- 
tive interactions. The data could raise interest- 
ing questions about, for example, patterns of 
proximity and communication within an 
organization, and flow patterns associated with 
high individual and group performance (4). 

We can also learn what a “macro” social 
network of society looks like (5), and how it 
evolves over time. Phone companies have 
records of call patterns among their custom- 
ers extending over multiple years, and e- 
Commerce portals such as Google and Yahoo 
collect instant messaging data on global com- 
munication. Do these data paint a compre- 
hensive picture of societal-level communica- 
tion patterns? In what ways do these interac- 
tions affect economic productivity or public 
health? It is also increasingly easy to track the 
movements of people (6). Mobile phones 
allow the large-scale tracing of people’s 
movements and physical proximities over 
time (7). Such data may provide useful epi- 
demiological insights: How might a patho- 
gen, such as influenza, driven by physical 
proximity, spread through a population? 

The Internet offers an entirely different 
channel for understanding what people are say- 
ing, and how they are connecting (8). Consider, 
for example, this past political season, tracing 
the spread of arguments, rumors, or positions 
about political and other issues in the blogo- 
sphere (9), as well as the behavior of individu- 
als “surfing” the Internet (/0), where the con- 
cerns of an electorate become visible in the 
searches they conduct. Virtual worlds, which 
by their nature capture a complete record of 
individual behavior, offer ample opportunities 
for research—experimentation that would 


otherwise be impossible or unacceptable (//). 
Similarly, social network Web sites offer a 
unique opportunity to understand the impact of 
aperson’s position in the network on everything 
from their tastes to their moods to their health 
(12), whereas Natural Language Processing 
offers increased capacity to organize and ana- 
lyze the vast amounts of text from the Internet 
and other sources (/3). 

In short, a computational social science is 
emerging that leverages the capacity to collect 
and analyze data with an unprecedented 
breadth and depth and scale. Substantial barri- 
ers, however, might limit progress. Existing 
ways of conceiving human behavior were 
developed without access to terabytes of data 
describing minute-by-minute interactions and 
locations of entire populations of individuals. 
For example, what does existing sociological 
network theory, built mostly on a foundation of 
one-time “snapshot” data, typically with only 
dozens of people, tell us about massively lon- 
gitudinal data sets of millions of people, 
including location, financial transactions, 
and communications? These vast, emerging 
data sets on how people interact surely offer 
qualitatively new perspectives on collective 
human behavior, but our current paradigms 
may not be receptive. 

There are also enormous institutional obsta- 
cles to advancing a computational social sci- 
ence. In terms of approach, the subjects of 
inquiry in physics and biology present different 
challenges to observation and intervention. 
Quarks and cells neither mind when we dis- 
cover their secrets nor protest if we alter their 
environments during the discovery process. As 
for infrastructure, the leap from social science 
to a computational social science is larger than 
from biology to a computational biology, 
largely due to the requirements of distributed 
monitoring, permission seeking, and encryp- 
tion. There are fewer resources available in the 
social sciences, and even the physical (and 
administrative) distance between social science 
departments and engineering or computer sci- 
ence departments tends to be greater than for 
the other sciences. 

Perhaps the thorniest challenges exist on the 
data side, with respect to access and privacy. 
Much of these data are proprietary (e.g., mobile 
phone and financial transactional information). 
The debacle following AOL public release of 
“anonymized” search records of many of its 
customers highlights the potential risk to indi- 
viduals and corporations in the sharing of per- 
sonal data by private companies (/4). Robust 
models of collaboration and data sharing 
between industry and academia are needed to 
facilitate research and safeguard consumer pri- 
vacy and provide liability protection for corpo- 


rations. More generally, properly managing pri- 
vacy issues is essential. As the recent U.S. 
National Research Council’s report on geo- 
graphical information system data highlights, it 
is often possible to pull individual profiles out 
of even carefully anonymized data (/5). Last 
year, the U.S. National Institutes of Health and 
the Wellcome Trust abruptly removed a num- 
ber of genetic databases from online access 
(16). These databases were seemingly anony- 
mized, simply reporting the aggregate fre- 
quency of particular genetic markers. How- 
ever, research revealed the potential for de- 
anonymization, based on the statistical power 
of the sheer quantity of data collected from 
each individual in the database (/7). 

Because a single dramatic incident involv- 
ing a breach of privacy could produce rules 
and statutes that stifle the nascent field of com- 
putational social science, a self-regulatory 
regime of procedures, technologies, and rules 
is needed that reduces this risk but preserves 
research potential. As a cornerstone of such a 
self-regulatory regime, U.S. Institutional Re- 
view Boards (IRBs) must increase their techni- 
cal knowledge to understand the potential for 
intrusion and individual harm because new 
possibilities do not fit their current paradigms 
for harm. Many IRBs would be poorly equip- 
ped to evaluate the possibility that complex 
data could be de-anonymized. Further, it may 
be necessary for IRBs to oversee the creation 
of a secure, centralized data infrastructure. 
Currently, existing data sets are scattered 
among many groups, with uneven skills and 
understanding of data security and widely 
varying protocols. Researchers themselves 
must develop technologies that protect privacy 
while preserving data essential for research. 
These systems, in turn, may prove useful for 
industry in managing customer privacy and 
data security (/8). 

Finally, the emergence of a computational 
social science shares with other nascent inter- 
disciplinary fields (e.g., sustainability science) 
the need to develop a paradigm for training 
new scholars. Tenure committees and editorial 
boards need to understand and reward the effort 
to publish across disciplines. Initially, computa- 
tional social science needs to be the work of 
teams of social and computer scientists. In the 
long run, the question will be whether acade- 
mia should nurture computational social scien- 
tists, or teams of computationally literate social 
scientists and socially literate computer scien- 
tists. The emergence of cognitive science offers 
a powerful model for the development of 
a computational social science. Cognitive sci- 
ence has involved fields ranging from neuro- 
biology to philosophy to computer science. It 
has attracted the investment of substantial 
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resources to create a common field, and created 
enormous progress for public good in the last 
generation. We would argue that a computa- 
tional social science has a similar potential, and 
is worthy of similar investments. 
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CELL. BIOLOGY 


Moonlighting in Mitochondria 


Martin G. Myers Jr. 


olecules known as signal transducers 

and activators of transcription (STATs) 

egulate gene expression in the 

nucleus in response to cell surface receptors 

that are activated by cytokines. On page 793 of 

this issue, Wegrzyn et al. (/) reveal that the 

isoform Stat3 also functions in another 

organelle—the mitochondria—to control cell 

respiration and metabolism. This finding not 

only reveals a new role for Stat3, but implies its 

potential role in linking cellular signaling path- 
ways to energy production. 

Stat3 proteins represent the canonical medi- 
ators of signals elicited by type I cytokine 
receptors at the cell surface (2). For instance, 
the adipocytokine leptin activates Stat3 in 
hypothalamic neurons to promote the expres- 
sion of the catabolic neuropeptide, propio- 
melanocortin, thereby regulating whole-body 
energy intake and metabolism (3). The binding 
of a cytokine to its receptor triggers an intra- 
cellular cascade of events, beginning with the 
activation of an enzyme, Jak kinase, which is 
associated with the receptor’s cytoplasmic 
domain. The activated receptor-Jak complex 
then recruits and phosphorylates a tyrosine 
residue in cognate STAT proteins. This modifi- 
cation causes the STAT protein to relocate to 
the nucleus, where, as a dimer, it binds to spe- 
cific DNA sequences and promotes gene 
expression (see the figure). Thus, the well- 
understood job of STAT proteins is to transmit 
a transcriptional signal from the cell surface to 
the nucleus. The phosphorylation of some 
STAT proteins ona specific serine residue may 
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also contribute to their regulation (2). 

Wegrzyn et al. have now identified another 
crucial role for Stat3, the isoform that responds 
to cytokines of the interleukin-6 and -10 fami- 
lies (including leptin). These cytokines act in 
the immune system and many other organ sys- 
tems to regulate diverse cellular processes, 
including differentiation, proliferation, and 


Cytokine 


Plasma membrane receptor 


Other 2 Serine 


stimuli ~ kinases 


Mitochondrion 


A cellular signaling pathway that responds 
to cytokines may coordinately control energy 
production by mitochondria. 


apoptosis (2). Noting that GRIM-19, a mito- 
chondrial protein, interacts with Stat3 and 
inhibits Stat3 transcriptional activity (4-7), the 
authors investigated the potential mitochondrial 
location of Stat3, revealing that a fraction of 
cellular Stat3 resides within the mitochondria 
of mouse myocytes and hepatocytes. Here, 
Stat3 associates with GRIM-19-containing 


Cytokine 


Nucleus 


Dual deployment. The activation of a cytokine receptor at the cell surface promotes the tyrosine phospho- 
rylation (Tyr-P) of Stat3, which dimerizes and moves to the nucleus to control gene expression. Serine phos- 
phorylation (Ser-P) of Stat3 appears to be required for its action in mitochondria, where it promotes 
increased oxidative phosphorylation. Because many stimuli promote the serine phosphorylation of Stat3, 
many signaling pathways could regulate mitochondrial respiration via Stat3. 
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complexes I and II, which are components of 
the electron transport chain that generates 
energy by oxidative phosphorylation. 

Wegrzyn et al. determined that in a subset of 
mouse B lymphocytes devoid of Stat3, oxida- 
tive phosphorylation was reduced, a defect 
attributable to the diminished activity of com- 
plexes I and II. That the number of mitochon- 
dria and their content of the proteins that consti- 
tute complexes I and II were not altered in 
Stat3-null cells suggests that Stat3 regulates the 
activity, as opposed to the absolute amount (as 
would be expected for transcriptional regula- 
tion), of complexes I and II. 

The expression of Stat3 in these otherwise 
Stat3-null cells restored oxidative phosphoryl- 
ation, and this rescue of mitochondrial func- 
tion did not require the DNA binding domain, 
the dimerization motif, or the tyrosine phos- 
phorylation site that controls Stat3 nuclear 
localization and transcriptional activity, con- 
sistent with a transcription-independent role 
for Stat3. By contrast, the conserved serine 
phosphorylation site on Stat3 was important: 
Expression of Stat3 with a mutation that pre- 
vented phosphorylation of this serine did not 


produce the rescue effect, whereas a phos- 
phomimetic mutant at this site did. It remains 
to be determined whether Stat3 is unique 
among STAT proteins in localizing to and 
regulating mitochondria. 

The nontranscriptional function of Stat3 in 
mitochondria raises many questions about its 
precise role in the organelle. For instance, 
although the results of Wegrzyn et al. reveal the 
absolute requirement for Stat3 to maintain nor- 
mal mitochondrial function, they do not speak 
directly to the potential role for Stat3 in the 
physiologic regulation of cellular respiration. 
Presumably, specific signaling pathways, such 
as those that regulate the serine phosphoryl- 
ation of Stat3, modulate Stat3-dependent con- 
trol of cellular respiration (see the figure). 
Whether the control of mitochondrial localiza- 
tion, complex I/II association, or some other 
step might underlie such regulation remains 
unclear, however. 

Although it is tempting to speculate that 
cytokines use Stat3 to coordinately regulate 
transcription and respiration, the inhibition of 
Stat3-dependent transcription by GRIM-19 
suggests that the opposite—Stat3 performs 


each job at the expense of the other—is just as 
likely. Also, because many cytokine-indepen- 
dent intracellular signaling proteins (such 
as protein kinase C isoforms and mitogen- 
activated protein kinases) promote the serine 
phosphorylation of Stat3 (2), cytokines may 
not be the only, or even the major, controllers of 
Stat3-modulated oxidative phosphorylation. 

Although many questions will take substan- 
tial research to work out, the newly discovered 
mitochondrial function for Stat3 has the poten- 
tial to dramatically expand the way we think 
about the roles of STAT proteins, as well as how 
canonical cellular signaling pathways may con- 
trol cell energetics. 
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GEOLOGY 


Heavy Metals or Punk Rocks? 


Robert J. Bodnar 


t the recent World Copper Congress 
A Santiago, Chile, Rio Tinto’s chief 

executive for copper, Bret Clayton, 
reported that copper consumption is expected 
to double over the next two decades (/); 
demand for other metals is expected to paral- 
lel this trend. These projected metal needs 
cannot be satisfied with known ore bodies. To 
locate new deposits, minerals exploration pro- 
grams require robust genetic models for the 
formation of economic accumulations of 
metals. On page 764 of this issue, Wilkinson 
et al. (2) elucidate one of the least understood 
aspects of ore formation: the concentration 
of metals in hydrothermal solutions that 
deposited the ores. 

Fluid inclusions provide the best available 
tool for determining the physical and chemi- 
cal conditions during ore formation (3). 
These microscopic samples (~5 to 50 um in 
diameter) of the ore-forming fluid are 
trapped in ore and non-ore minerals during 
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mineralization; they thus record the tempera- 
ture, pressure, and composition of the ore- 
forming fluid. Advanced microanalytical 
techniques allow individual fluid inclusions 
to be analyzed to determine ore metal con- 
centrations (4—6). 

Wilkinson et al. now report unusually high 
metal contents of hydrothermal fluids from 
two ore districts containing sediment-hosted 
zinc-lead deposits, based on microanalysis of 
fluid inclusions. Other recent studies of fluid 
inclusions from copper (7) and gold (8) 
deposits also found much higher metal con- 
centrations than would have been predicted on 
the basis of experimental data or theoretical 
models. If confirmed by further studies, these 
results have important implications for both 
the duration of the ore-forming process and 
the amounts of ore fluids needed to generate 
world-class ore deposits. This information is 
of crucial importance for understanding ore 
genesis, given that the duration of ore-forming 
systems is one of the major unknowns related 
to the formation of mineral deposits (9). 

Average continental crust contains about 
70 ppm zine (Zn) and 12.5 ppm lead (Pb). In 


Meeting the world’s demand for metals such as 
copper and zinc may be aided by recent 
insights into the formation of ore deposits. 


contrast, average ore grades in Mississippi 
Valley-type (MVT) Zn-Pb deposits similar to 
those studied by Wilkinson ef al. typically are 
about 6% Zn and 2% Pb by weight, represent- 
ing enrichment factors of about 850 and 1600, 
respectively. Thus, metals must be scavenged 
from a large volume of rock with average 
crustal metal values and concentrated into a 
much smaller rock volume to generate eco- 
nomic deposits. For example, large to giant 
MVT deposits contain on the order of 10° to 
2 x 10’ metric tons combined Zn + Pb (/0), 
with an average Zn:Pb ratio of ~3. 

Garven (//) modeled the fluid-flow his- 
tory associated with formation of a large 
MVT Zn-Pb deposit in Pine Point, Canada, 
and concluded that the total hydrothermal 
fluid discharge through the mineralized area 
was 5 x 10° m? year !. Garven assumed that 5 
mg of zinc precipitated per kilogram of solu- 
tion that flowed through these rocks and con- 
cluded that it would take from 0.5 to 5.0 mil- 
lion years to form the deposits, with Darcy 
flow rates (12) of 1 to S m year. Similar dura- 
tions for ore formation (0.3 million years) 
have been estimated for the MVT deposits in 
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operate to promote the precipitation 
of all (or most) of the dissolved met- 
als transported by the hydrothermal 
fluids. The results presented by 
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the Upper Mississippi Valley district of the 
United States (/3). 

Flow rates and duration of the ore-form- 
ing process reported by Garven (//) require 
total hydrothermal fluid volumes ranging 
from 2500 to 25,000 km? over the lifetime of 
the ore-forming system. Similar volumes of 
fluid would be required to form other large to 
giant MVT deposits if each kilogram of fluid 
only precipitates a few milligrams of metal. 
However, if the metal content of the ore- 
forming fluid is considerably higher, as sug- 
gested by Wilkinson ef al., then both the 
amount of fluid required and the duration of 
the ore-forming event would be reduced by 
orders of magnitude (see the figure). For 
example, if each kilogram of hydrothermal 
fluid deposited 10° mg of Zn (orange dot in 
the figure), then Pine Point and similar 
deposits could have formed in about 10+ 
years from a few cubic kilometers of 
hydrothermal fluid, compared to the millions 
of years and hundreds of cubic kilometers of 
fluid required assuming that each kilogram 
of hydrothermal fluid deposited 5 mg of Zn 
(green dot in the figure). 

The results presented by Wilkinson et al. 
further highlight the importance of deposi- 
tional processes in the formation of economic 
occurrences of metals. Most ore geologists 
now agree that fluids with metal contents suf- 
ficient to produce economic mineralization 
are relatively common (/4), and that it is the 
lack of a suitable depositional mechanism 
that often limits ore formation. Temperature 
decrease alone cannot be the dominant mech- 
anism, because the solubility of most metals 
in most hydrothermal fluids decreases by only 
a small amount over the temperature range 
determined for most deposits. Thus, other 
processes—such as boiling or immiscibility, 
fluid mixing, or fluid-rock interactions—must 


Wilkinson ef al. provide important 
new insights into metal contents of 
ore-forming fluids and emphasize 
the need for continued research to constrain 
the amounts of hydrothermal fluids required 
to form world-class ore deposits and the dura- 
tion of the ore-forming events. 
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How to form an ore deposit. This modified “Roedder diagram” (15) shows the relationship 
between the amount of metal precipitated per unit mass of hydrothermal fluid (y axis) and the size 
of the ore deposit (x axis). The time (black diagonals) and volume of fluid (red dashed diagonals) 
required to form the deposit are contoured onto these coordinates. The width of the shaded box rep- 
resents the range in ore tonnage for large to giant MVT Zn-Pb deposits (20). Wilkinson et al. report 
zinc concentrations of 5000 ppm and 3000 ppm (dashed lines near the top of the box). These con- 
centrations are higher than previously reported and suggest that economic deposits can form faster 
than previously suggested (green and orange dots). 
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Confined Polymers Crystallize 


Piet J. Lemstra 


Squeezing very thin polymer layers can cause them to form polymer single crystals that could 


make plastic films less permeable to gases. 


lastics have been very successful in 
Prosi glass, metals, and wood, in 

part because they are light and easy to 
process into complex shapes at high speed and 
at low cost. However, in applications such as 
packaging, molded plastics can be at a disad- 
vantage compared with steel, aluminum, and 
glass because of their relatively high perme- 
ability to atmospheric gases such as O, and 
CO,. This problem arises because the syn- 
thetic polymers that are the main component 
of plastics are rather randomly organized in 
the solid state with sufficient spaces between 
the molecules that allow for gas diffusion. 
Although the problem can be solved to some 
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extent by adding less-permeable materials to 
plastics, ideally it would be desirable to find a 
way to arrange the long-chain polymer mole- 
cules in an orderly way, namely, into crystal- 
lites in which the molecules are closely 
packed. On page 757 of this issue, Wang 
et al. (1) report that very thin layers of a com- 
monly used polymer crystallize through spe- 
cial processing conditions into so-called poly- 
mer single crystals, which is surprising given 
the known difficulties in getting polymers to 
form crystals. 

The focus of most polymer research is on 
functional properties in emerging areas such 
as biomedical engineering (2), electronics (3), 
and energy [for example, plastic solar cells 
(4)], but the vast bulk of synthetic polymers 
are used in plastics. Packaging materials 
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consume about 40% of the plastics produced. 
Plastics have surpassed steel in terms of pro- 
duction volume, with about 250 million tons 
of plastics produced annually worldwide. 

Despite this widespread use, there are 
packaging applications where the gas perme- 
ablility of plastics, especially to oxygen, ham- 
pers their use, from beer bottles to coatings in 
advanced electronics. Plastics are quite per- 
meable because their softening temperatures 
(the glass transition temperature, 7,) are low 
in comparison with those of silicate glass and 
much lower than the melt temperatures of 
metals. For commonly used plastics, 7 
ranges from —100° to about 100°C, whereas 
for silicate glasses, T. . ranges from 500° to 
above 1000°C. 

Gases or liquids permeate a plastic or glass 
by first dissolving in it and then diffusing 
through it. In the simplest theoretical descrip- 
tion, the permeability is the product of the 
gas’s solubility and its diffusivity. At room 
temperature, the diffusivity—the movement 


Packing polymer chains. Polymer single crystals can be obtained from very 
dilute solutions, e.g. polyethylene single crystals as visualized by AFM (A) in 
which the chains are folded (B) (9). Wang et al. have now succeeded in creating 


of the penetrating molecules—in silicate glass 
is almost zero because the silicate chains are 
“frozen” at ambient temperatures far below 
their T,. The packing of these chains is not 
orderly but is still very tight and provides very 
little empty space and mobility for the pene- 
trating molecules to pass through. 

In contrast, the segments making up the 
polymer chains are quite mobile at room tem- 
perature. This movement opens up pathways 
for low—molar mass molecules such as O, and 
CO, and, in the case of polar polymers such as 
nylon, for water. In general, the higher the 7. ‘ 
of a plastic, the better its barrier properties. 
For that reason, the polyester poly(ethylene 
terephthalate) (PET), with a T, of ~ 80°C, can 
be used for carbonated sodas, but it is too per- 
meable to oxygen for storing beer. 

Various options exist to improve the barrier 
properties of plastic systems, such as applying 
a metal or ceramic coating or processing alter- 
nating layers of the plastic film with deposited 
ceramic film. More recently, nanoscale clay 
particles (made from exfoliated single layers 


of clay) have been dispersed in a plastic film 
(5). These thin clay layers are impermeable to 
gases and create a tortuous path for diffusing 
molecules; the path length for diffusion in- 
creases and slows gas exchange. 

Another approach, but one that is much 
more challenging, is to enhance the barrier 
properties of polymers by getting them to crys- 
tallize because polymer crystals are imperme- 
able to gases. However, the long chains of syn- 
thetic polymer molecules are entangled with 
each other, much like cooked spaghetti. How 
can these long-chain and highly entangled 
molecules form ordered crystallites? 

The first efforts at polymer crystallization 
avoided highly entangled melts and started 
with dilute solutions. In the 1950s, Keller (6), 
Fischer (7), and Till (8) found independently 
that linear polyethylene (PE) can form 
platelet single crystals upon cooling dilute PE 
solutions (see the figure, panel A). The long- 
chain molecules are folded in these crystals 
(9). The fold length, which corresponds to the 
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crystal thickness, is very small, on the order 
of 10 to 20 nm (see the figure, panel B). The 
concept of folded-chain crystallization con- 
tinues to raise enormous interest among poly- 
mer physicists, because it is found that all 
polymer-chain molecules with a regular 
chemical structure form these folded-chain 
lamellar crystals. 

However, well-defined folded-chain crys- 
tals, referred to as lamellae, can only be grown 
from dilute solutions, which is too slow and 
inefficient for manufacturing. In processed 
products such as films, containers, and fibers, 
the polymers that are cooled from the molten 
state are only partly crystalline. Although 
there is still a driving force for the long-chain 
molecules to form folded-chain crystals, this 
process is hampered because the chain mole- 
cules are highly entangled polymers. 

Crystallization from the polymer melt 
starts from nuclei (catalyst residues or pur- 
posely added nucleation agents), and chain- 
folded crystallites emanate from such nuclei. 
However, the entangled polymer chains only 


partly incorporate into the crystallites, creating 
spherical crystal aggregates called spherulites. 
Thus, melt-crystallized polymers are semi- 
crystalline because those chain segments that 
are not folded into crystallites remain amor- 
phous. Partial crystallization improves the bar- 
rier properties to some extent, because the 
crystallites are not permeable and create a tor- 
tuous path for small molecules, just as in the 
case of the nanoclay platelets. 

Wang et al. now show how poly(ethylene 
oxide) (PEO)—a crystallizable polymer— 
can be processed from the melt to form very 
thin layers consisting of polymer single crys- 
tals. The authors use a process in which alter- 
nating layers of PEO and EAA poly(ethylene- 
co-acrylic acid) are coextruded. The pres- 
sures applied in this process force the long- 
chain molecules of PEO into their most com- 
pact arrangement, that of large folded-chain 
lamellar crystals (see the figure, panels C and 
D), while the EAA layers remain amorphous. 
The permeability of PEO to oxygen decreased 
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crystallized polymer layers (C) from the melt by extruding alternating thin layers 
of two different polymers. (D)These single-crystal layers inhibit gas diffusion by 
creating barriers that make the path that molecules take more tortuous. 


by about two orders of magnitude in this crys- 
talline form. 

Showing that large lamellar crystals can be 
grown from the melt is not only of high aca- 
demic interest. The multilayer extrusion tech- 
nique is used to create higher-value products 
through processing [for example, flexible mir- 
rors (/(0)]. Crystalline polymers created in this 
way may in future function as barrier layers for 
semicrystalline polymers used in packaging. 
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INTRODUCTION 


Mr. Darwin 


THE DIVERSIFICATION AND SPECIATION OF LIVING ORGANISMS ARE THE BROAD 
theme for this special section, continuing our celebration of Charles Darwin’s 
200th birthday. The five Reviews in this section present multiple views on 
research on diversification at scales ranging from the macroevolutionary to the 
molecular. Benton (p. 728) examines the extents to which biotic and abiotic fac- 
tors have shaped species diversity in the fossil record. Gavrilets and Losos (p. 732) 
use theoretical predictions and empirical data to identify general patterns in the 
temporal, spatial, and genetic/morphological properties of adaptive radiation. 
Schluter (p. 737) reviews how research on ecological speciation has shifted in 
focus from morphological evolution to reproductive isolation, tracing the links 
between Darwin’s ideas and current thinking. Fraser et al. (p. 741) discuss the con- 
tentious area of microbial species formation. Finally, using examples from studies 
of genes and mutations involved in evolutionary change, Stern and Orgogozo 
(p. 746) illustrate how developmental biology and evolutionary theory might 
combine to reveal new predictive principles of genetic evolution. 

The special section is accompanied by evolutionary coverage in all other sec- 
tions of the magazine. The News section features the second in our series of 
monthly “origins” essays, on the origins of art and symbolic behavior. The Com- 
mentary section includes reviews of new evolutionary books. Science Careers 
carries a feature on researchers in the museum world, who play a vital role in 
evolutionary research. In Reports, Krug ef al. (p. 767) reveal the legacy of the 
end-Cretaceous mass extinction for the subsequent diversification of bivalves 
[see the accompanying Perspective by Crame (p. 720)]. With a focus on conser- 
vation, Carnaval et al. (p. 785) model spatially explicit evolutionary processes 
in endemic tree-frog species in the Brazilian Atlantic Forest (the biodiversity 
hotspot that inspired Darwin during his South American landfall). Forbes et al. 
(p. 776) reveal how a recent host shift of the fly Rhagoletis pomonella, a model 
for sympatric speciation, has led to incipient speciation in a parasitoid wasp that 
attacks the fly. Rowland and Emlen (p. 773) show facultative male trimorphism 
in dung beetles, a hitherto unsuspected level of intraspecific variation. Tang and 
Presgraves (p. 779) report a cellular and molecular mechanism of hybrid steril- 
ity, which has a key role in speciation in Drosophila. And in Science Express, 
there is an echo of another of Darwin’s central interests: Anderson et al. show 
that a melanism mutation has been selected for in gray wolves, most likely after 
a hybridization event with domestic dogs. 

This is a sample, not a survey. Research on speciation and diversification is 
itself expanding and diversifying. Evolutionary topics have been covered more fre- 
quently in Science in the first decade of the 21st century than in any previous one 
(and an order of magnitude more than in the years of the Modern Synthesis in the 
mid-20th century). This reflects not only the continuing efforts to understand and 
document the selective forces leading to speciation, but also how genetic research 
is homing in on the molecular and cellular mechanisms that enable diversification 
to occur. Darwin, we hope, would be thrilled. 

— ANDREW SUGDEN, CAROLINE ASH, BROOKS HANSON, LAURA ZAHN 
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REVIEW 


The Red Queen and the Court Jester: 
Species Diversity and the Role of Biotic 
and Abiotic Factors Through Time 


Michael J. Benton 


Evolution may be dominated by biotic factors, as in the Red Queen model, or abiotic factors, 

as in the Court Jester model, or a mixture of both. The two models appear to operate 
predominantly over different geographic and temporal scales: Competition, predation, and other 
biotic factors shape ecosystems locally and over short time spans, but extrinsic factors such as 
climate and oceanographic and tectonic events shape larger-scale patterns regionally and 
globally, and through thousands and millions of years. Paleobiological studies suggest that species 
diversity is driven largely by abiotic factors such as climate, landscape, or food supply, and 
comparative phylogenetic approaches offer new insights into clade dynamics. 


here are two ways of viewing evolution, 

through the spectacles of either the Red 

Queen or the Court Jester. The Red Queen 
model (/) stems from Darwin, who viewed evo- 
lution as primarily a balance of biotic pressures, 
most notably competition, and it was character- 
ized by the Red Queen’s statement to Alice in 
Through the Looking-Glass that “it takes all the 
running you can do, to keep in the same place.” 
The Court Jester model (2) is that evolution, 
speciation, and extinction rarely happen except in 
response to unpredictable changes in the physical 
environment, recalling the capricious behavior 
of the licensed fool of Medieval times. Neither 
model was proposed as exclusive, and both 
Darwin and Van Valen (/) allowed for extrinsic 
influences on evolution in their primarily biotic, 
Red Queen views. 

Species diversity in a Red Queen world de- 
pends primarily on intrinsic factors, such as body 
size, breadth of physiological tolerance, or adapt- 
ability to hard times. In a Court Jester world, spe- 
cies diversity depends on fluctuations in climate, 
landscape, and food supply. In reality, of course, 
both aspects might prevail in different ways and at 
different times, what could perhaps be called the 
multilevel mixed model. Traditionally, biologists 
have tended to think in a Red Queen, Darwinian, 
intrinsic, biotic factors way, and geologists in a 
Court Jester, extrinsic, physical factors way. 

Much of the divergence between the Red 
Queen and Court Jester world views may depend 
on scale (2) (Fig. 1): Biotic interactions drive 
much of the local-scale success or failure of in- 
dividuals, populations, and species (Red Queen), 
but perhaps these processes are overwhelmed by 
substantial tectonic and climatic processes at time 
scales above 10° years (Court Jester). It is im- 
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portant not to export organism-level processes to 
regional or global scales, and it is likely that 
evolution operates in a pluralistic way (3). 

There are two broad methodologies for studies 
of species diversity through time, taxic and phy- 
logenetic (4). The taxic approach involves treat- 
ing species, genera, or families as independent 
entities and counting their occurrences against 
time and other factors. The phylogenetic ap- 
proach uses cladograms or molecular trees to 


Fig. 1. Operation of Red Queen 


disentangle key aspects of clade histories. Clades 
are monophyletic, including all descendants of an 
ancestor, whereas taxa may be monophyletic or 
paraphyletic, excluding some descendants of the 
ancestor. Comparative macroecological studies 
add rigor to analyses showing that sister clades 
may vary in rate of evolution, timing of increases 
in species richness and morphospace occupation, 
and distributions of evolutionary novelties across 
lineages and subclades. Here, I will explore the 
largest-scale global, taxic investigations, provide 
an outline of how these and other studies corre- 
spond to the predictions of the Red Queen, Court 
Jester, and multilevel mixed models (Table 1), and 
outline some phylogenetic studies of the macro- 
evolution of species diversity. 


The Global Pattern of Diversification 
Through Time 


A key question about the origin of modern bio- 
diversity is how today’s 10 million species arose 
from a single ultimate species of microbial life 
3500 million years ago (Ma) (Fig. 1). Two mod- 
els for global diversification are termed the 
saturation/equilibrium model (5—7) and the ex- 
pansion model (8—//). The equilibrium model has 
prevailed, among marine paleobiologists at least, 
for a long time, and represents a classic Red Queen 
viewpoint because it implies primarily biotic con- 
trols (density dependence) on global diversity. 
There are two versions of the equilibrium 
model, differing in the time when the global 


(biotic causation) and Court Jester A 

(abiotic causation) models at differ- rane 

ent geographic and temporal scales ® (3-6 My) 

(A). The Red Queen may prevail at 8 

organismic and species level on short Species 

time scales, whereas the Court Jester o life span 
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likely the result of spatial averaging of 
regional responses to climate change 
and other complex physical pertur- 
bations that may be in opposite di- 
rections, and so cancel each other, 
suggesting no controlling effect of the 
physical environment on evolution. B 
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marine ecosystem became saturated. 
Sepkoski’s coupled logistic model (5) = 
identified three equilibria, in the Cam- 3 
brian, most of the Paleozoic, and per- 
haps a third, beginning in the Pliocene 
and continuing to the present (Fig. 
2A). These three equilibrium levels 
correspond to three sets of phyla, the 
Cambrian, Paleozoic, and Modern, that 
interacted and successively replaced 
each other through the Cambrian- 
Ordovician and Permo-Triassic inter- 
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vals, reaching higher equilibrium levels 
after each long-term replacement event. 
The second model (6, 7) identifies a 
single equilibrium level from the early 
Paleozoic, perhaps 400 Ma, to the 
present (Fig. 2B). In both models, the 
equilibria correspond to biodiversity 
saturation in which new taxa could 
become established only by driving 
others to extinction. Key evidence is 
that both origination and extinction 
rates appear to have been density- 
dependent (5—7), limiting rises in di- 
versity and promoting rapid recovery 
after extinction events. 

Alternative models for global di- 
versification are expansionist, allowing global 
species diversity to rise, with damping, but with- 
out a predictable limit ($-//). Density depen- 
dence of origination and extinction rates does not 
preclude expansionist models because they may 
be dampened by limiting factors such as shortage 
of food or space, or active predation, as well as by 
climate and other physical factors. Further, it 
seems that the coupled logistic model may be 
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Fig. 2. Patterns of marine animal genus diversification through the past 
530 My, the Phanerozoic. The two lines compare current estimates from 
the empirical (uncorrected) Sepkoski database (red line) and sampling- 
standardized (corrected) analysis of the Paleobiology Database (blue line). 
The empirical curve (red line) suggests that global marine diversity reached 
a possible plateau through the Paleozoic (450 to 250 Ma) and has risen, 
apparently exponentially, ever since. The sampling-standardized curve 
(blue line) suggests that global marine diversity reached near-modern 
levels some 400 Ma and there has been only modest increase since then. 
Cm, Cambrian; C, Carboniferous; D, Devonian; J, Jurassic; K, Cretaceous; 
Ng, Neogene; O, Ordovician; P, Permian; Pg, Paleogene; S, Silurian; Tr, 
Triassic. Based on (6). 


partly an artifact of taxonomic scale (Fig. 2, red 
curve); it was worked out at ordinal and familial 
levels but does not work convincingly at generic 
or specific levels (0, 1/), there are problems with 
key numerical assumptions (//, /2), and the back- 
ground assumption of a global carrying capacity is 
doubtful (8, 70, 11). Further, it has proved hard to 
export the logistic model to the much more 
speciose terrestrial realm, whether one considers 
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plants, insects, or vertebrates, because 
these groups seem to have radiated 
explosively, without diversity plateaus, 
particularly in the past 100 million 
years (My) (10). 

Resolution between the equilibri- 
um models, and between these and 
expansionist models, might seem 
straightforward, but the solution de- 
pends on adequate assessment of the 
quality of the fossil record. The long- 
term saturation model for global diver- 
sification (Fig. 2, blue curve) arises 
from extensive attempts to correct data 
sets for sampling error (6, 7), where- 
as the multiple-equilibria and expan- 
sion models originally used raw data, 
without correction (5, 8—//), although 
more recent analyses return a some- 
what dampened but congruent signal 
when corrections are imposed. Cor- 
rection for sampling is clearly es- 
sential (6, 7), and future investigation 
must determine appropriate indepen- 
dent proxies for preservation and hu- 
man etror; some current proxies (such 
as number of fossiliferous localities) 
are themselves dependent on diversity, 
and other correction regimes may be so complex 
as to produce data in which geologic and biologic 
signals are not obviously separated. 

Life on land today may be as much as 25 
times as diverse as life in the sea, so it may be 
wrong to generalize from marine paleontolog- 
ical studies to all life. Perhaps land and sea 
show similar patterns of exponential increase in 
species numbers (8, 9, //), or perhaps they 
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Table 1. Macroevolutionary phenomena and their support for either the Red Queen (biotic, intrinsic) or Court Jester (physical, extrinsic) models. Many 
could fit either worldview, and so are noted as “multilevel mixed.” 


Red Queen 


Court Jester 


Multilevel mixed 


Interspecific competition 


Character displacement 


Evolutionary arms races (2) 


Constancy of ecological 
guilds through time (25) 
Incumbency advantage 
(3, 24) 


Waxing and waning of clades in association with 
tectonic and oceanographic events (2, 17) 

Mass extinctions and smaller extinction events 
triggered by extrinsic causes such as eruptions, 
climate change, anoxia, impact (10, 11) 

Coordinated turnovers, originations, and extinctions 
in response to physical perturbations— termed 
“coordinated stasis” or “turnover pulse” hypothesis 
(2, 29, 30) 

Nonconstant probability of extinction (3, 11) 


Lack of evidence for a global carrying capacity and 
equilibrium levels (8, 10) 

Lack of cohesiveness of the great “evolutionary 
faunas” (12) 

Species richness—energy relationship (18, 19) 


Inverse relationship between global temperature and 
biodiversity (21) 

Lack of clear correlation of species richness with 
body size or other biotic factors (16) 


Vicariance and dispersal in major phylogenetic splits (17) 


Latitudinal diversity gradient (22-24) 


Occupation of new ecospace (25) 


Subdivision of niches/specialization (10, 25) 
Declining global extinction rates through time (2, 5) 
Onshore-offshore patterns and disturbance (3) 


Resource use: stenotopes are more speciose than 
eurytopes (29, 30) 
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differ in their key rules (/3, /4), with the sea 
acting as a giant Gaussian petri dish, where spe- 
cies diversity is equilibrial and density-dependent, 
and the land witnessing continuing (dampened) 
exponential rise in diversity as ever new sectors 
of ecospace are conquered (9, 14). Any model for 
global diversification must encompass the inde- 
pendent evidence for increasing complexity of 
organisms, increases in the occupation of novel 
ecospace, explosive evolution within particular 
clades, and addition of novel clades without the 
loss of precursors (9, /7, 15), all of which have 
happened many times in the past 500 My. 


Large-Scale Controls on Species Diversity 


Taxic paleobiological studies have provided a 
great deal of evidence about controls, mainly 
abiotic, on species diversity. Biotic factors, such 
as body size, diet, colonizing ability or ecological 
specialization, appear to have little effect on the 
diversity of modern organisms, although abun- 
dance and r-selected life-history characteristics 
(short gestation period, large litter size, and short 
interbirth intervals) sometimes correlate with high 
species richness (/6). 

Geographic and tectonic history has generated 
patterns of species diversity through time. The slow 
dance of the continents as Pangaea broke up during 
the past 200 My has affected modern distribution 
patterns. Unique terrestrial faunas and _ floras, 
notably those of Australia and South America, 
arose because those continents were islands for 
much of the past 100 My. Further, major geologic 
events such as the formation of the Isthmus of 
Panama have permitted the dispersal of terrestrial 
organisms and have split the distributions of marine 
organisms. A classic example of vicariance is the 
fundamental division of placental mammals into 
three clades, Edentata in South America, Afrotheria 
in Africa, and Boreoeutheria in the northern 
hemisphere, presumably triggered by the split 
of those continents 100 Ma (/7). Other splits in 
species trees may relate to dispersal events, or 
there may be no geographic component at all. 

Species richness through time may corre- 
late with energy. The species richness—energy 
relationship (/8) posits correlations with evapo- 
transpiration, temperature, or productivity, and 
studies of terrestrial and marine ecosystems have 
shown that these factors may explain as much as 
90% of current diversity, although relationships 
between species diversity and productivity change 
with spatial scale (/9). Over long time spans, 
there are strong correlations between plankton 
morphology and diversity and water temperature: 
Cooling sea temperatures through the past 70 My, 
and consequent increasing ocean stratification, 
drove a major radiation of Foraminifera, asso- 
ciated with increasing body size (20). More 
widely, there is close tracking between temper- 
ature and biodiversity on the global scale for both 
marine and terrestrial organisms (2/), where 
generic and familial richness were relatively 


low during warm “greenhouse” phases of Earth 
history, coinciding with relatively high origina- 
tion and extinction rates. 

A much-studied manifestation of energy and 
temperature gradients is the latitudinal diversity 
gradient (LDG), namely the greater diversity of 
life in the tropics than in temperate or polar re- 
gions, both on land and in the sea. There are two 
explanations (22): (i) the time and area hypoth- 
esis, that the tropical belt is older and larger than 
temperate and polar zones, and so tropical clades 
have had longer to speciate, or (ii) the diversifi- 
cation rate hypothesis, that there are higher rates 
of speciation and lower rates of extinction in the 
tropics than elsewhere. There is geological and 
paleontological evidence for a mixture of both 
hypotheses (23, 24). 

Species diversity may increase by the occu- 
pation of new ecospace. The number of occupied 
guilds, that is, broad ecological groupings of 
organisms with shared habits, has increased in 
several steps through time, from 20 in the early 
Paleozoic to 62 in post-Paleozoic marine faunas 
(25). Further, marine animals have shown several 
step increases in tiering, the ability to occupy and 
exploit different levels in the habitat: At times, 
burrowers have burrowed deeper, and reef- 
builders have built taller and more complex reefs. 
Analogous, if even more dramatic, expansions of 
ecospace have occurred on land, with numerous 
stepwise additions of new habitats, from the 
water-margin plants and arthropods of the early 
Paleozoic to the forests and upland habitats of the 
later Paleozoic when land animals first burrowed, 
climbed, and flew, through the introduction of 
herbivory, giant size, endothermy, and intelli- 
gence among vertebrates, and the great blossom- 
ing of flowering plants (with associated vast 
expansions in diversity of plant-eating and social 
insects and modern vertebrates) during the Cre- 
taceous Terrestrial Revolution 100 Ma (26). 

The other mode of species increase globally 
or regionally is by niche subdivision, or increas- 
ing specialization. This is hard to document be- 
cause of the number of other factors that vary 
between ecosystems through time. However, mean 
species number in communities (alpha diversity) 
has increased through time in both marine (/5, 25) 
and terrestrial (10) systems, even though niche 
subdivision may be less important than occupa- 
tion of new ecospace in increasing biodiversity. 
Further, morphological complexity may be quan- 
tified, and a comparative study of crustaceans 
shows, for example, that complexity has increased 
many times in parallel in separate lineages (27). 


Phylogenetic Studies of Clade Histories 

Species are not randomly distributed; they have 
an evolutionary history, and so occur as twigs on 
a great phylogenetic tree. Studying species as 
members of clades is a fruitful approach to un- 
derstanding the drivers and controls on specia- 
tion. Key questions include (i) Do species 


diversify early in a clade’s history? (ii) How do 
diversity and disparity (variance in characters or 
morphology) covary? (iii) Do major lineages 
within a clade follow similar, or different, pat- 
terns, and if different, why? (iv) Do evolutionary 
radiations follow the acquisition of new charac- 
ters or emptying of ecospace? (v) How do major 
clades of apparent competitors interact over long 
spans of geologic time? and (vi) How do sister 
clades vary in species diversity and why? 

For such analyses, the ideal is a complete 
species tree, a phylogenetic tree that contains 
all species living and extinct, plotted accurately 
against geologic time (4). Simple to say; hard to 
achieve. More commonly, incomplete trees have 
been used, with the risk of error in calculations of 
evolutionary rates or comparisons of subclades. 
In paleontology, it has proven much easier to 
work with higher taxa such as genera or families 
because species fossil records are less complete 
than those of higher taxa, and yet it is not clear 
how higher-level patterns relate to those at spe- 
cies level. Many key questions can be tackled by 
comparing a real tree to a hypothetical tree that 
follows an equal-rate Markov (ERM) model, 
equivalent to tree growth after a random walk, 
where equal chances of speciation and of ex- 
tinction are shared by all species (4). 

Major biotic replacements, where one clade 
replaces another, have been a focus of debate 
about the roles of competition and progress in 
macroevolution, and dinosaurs provide a classic 
example. The standard view was that dinosaurs 
originated in the Late Triassic, some 230 Ma, by a 
process of competition in which they prevailed 
over their precursors, the crocodile-like crurotar- 
sans and others, because of superior adaptations. 
A comparative phylogenetic study (28) shows, 
however (Fig. 3), that the Dinosauria expanded in 
two steps, one after an extinction event 225 Ma 
that removed dominant herbivores, and the second 
following the end-Triassic extinction 200 Ma that 
removed most of the crurotarsans. Dinosauria 
remained at moderate diversity and low disparity, 
and at lower disparity than the crurotarsans they 
supposedly out competed, during the 25 My be- 
tween the events, suggesting that there was no 
insistent competition driving other groups to 
extinction but rather that the dinosaurs occupied 
new ecospace opportunistically, after it had been 
vacated. 

A further study on Dinosauria explored the 
subsequent evolution of the clade (26). Classic 
views that the dinosaurs arose with a flourish, 
and then finally gave way in the Cretaceous to the 
superior mammals, or that they dwindled to ex- 
tinction because of “racial senility,” had long 
been abandoned. The dinosaurs seemed to be 
radiating actively in the Cretaceous, with many 
new clades appearing through their last 55 My, 
and especially in their final 15 My. The new 
study (26) shows that most diversification shifts 
(departures from ERM assumptions) fall in the 
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first one-third of the history of the clade and that 
their continuing diversification in the Late Jurassic 
and Cretaceous was mainly indistinguishable from 
a random walk. In particular, dinosaurs did not 
participate in the Cretaceous Terrestrial Revolu- 
tion, some 130 to 100 Ma, when flowering plants, 
leaf-eating insects, social insects, squamates, and 
many other modem groups radiated substantially. 


245.9 237 232.5 228 


There is no geometric reason that diversification 
shifts should mainly occur low in a clade’s history: 
Clade shapes vary from bottom-heavy to top- 
heavy, and diversification shifts may be concen- 
trated low (dinosaurs and bats) or high (insects 
and ants) in a clade (26). 

In the future, the identification of diversification 
shifts across numerous taxa may provide evidence 


203.6 199.6. 


Anisian 


Lad | Crn 


Dinosauria 


Crurotarsi 


0.12 
0.06 
N 
2 0) 
oO 
£ 
Z -0.06 
° 
fe) 
oO 
Oe 
2 : 
2 018) = 
-0.24 
-0.3 


Crurotarsan morphospace 


Norian 
i] 
[1 Dinosauromorphs” 
ORNITHISCHIA 
SAUROPODOMORPHA 
THEROPODA 
PHYTOSAURIA 
AETOSAURIA 
CROCODYLOMORPHA 
“Rauisuchids” 


POPOSAUROIDEA 


ORNITHOSUCHIDAE 


Rh.) EJ 


ee _ © 
Dinosaur morphospace - 


> Pterosaur morphospace 


-0.24 


-0.16 -0.08 


0.08 0.16 0.24 0.32 


Principal coordinate 1 


Fig. 3. Phylogenetic relationships and morphospace occupation for Triassic archosaurs. (A) Framework 
phylogeny for Triassic crurotarsans scaled to the Triassic time scale. Numbers at top refer to millions of years 
before the present; gray bars represent the observed durations of major lineages; vertical dashed lines denote 
two extinction events, at the Carnian-Norian and Triassic-Jurassic boundaries; arrowheads indicate lineages that 
survived the latter event. Lad, Ladinian; Crn, Carnian; Rh, Rhaetian; EJ, Early Jurassic. (B) Empirical morphospace 
for Late Triassic archosaurs, based on the first two principal coordinates. Large circles, dinosaurs; ovals, 
pterosaurs; squares, poposauroids; hexagons, phytosaurs; stars, aetosaurs; crosses, crocodylomorphs; smaller 
black dots, “rauisuchids”; larger black dots, nondinosaurian dinosauromorphs, Scleromochlus. Based on (28). 
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for the relative importance of the Red Queen and 
Court Jester worldviews. If the majority of diver- 
sification shifts are coordinated, and associated 
with particular climatic, tectonic, and geographic 
drivers, then the Court Jester model of macro- 
evolution would prevail. This would link most 
increases in species diversity to particular large- 
scale radiation events, such as the Cretaceous Ter- 
restrial Revolution (26), or recoveries after mass 
extinctions. If, on the other hand, the majority of 
diversification shifts are unique to particular clades, 
and not coordinated temporally with others, then 
the Red Queen worldview might be considered. 


Comparing Sister Taxa 


A powerful element of the comparative phyloge- 
netic approach to species diversity through time 
is the opportunity to compare sister taxa. Sisters 
arose from a single ancestor, and so their tra- 
jectories occupy the same amount of time, and 
they started with the same genotype and phe- 
notype. Any similarities in their subsequent evo- 
lution probably reflect this phylogenetic signal of 
a common origin, but differences reflect inde- 
pendent aspects of their separate histories. 
Comparisons of sister taxa have allowed tests 
of the resource-use hypothesis (29), that general- 
ists are less speciose and have longer species 
durations than specialists. Specialists divide the 
physical environment into small patches, each 
occupied by a species, and each probably more 
subject to environmental crises than their gener- 
alist relatives. Classic examples in support of the 
resource-use hypothesis come from studies of Ne- 
ogene mammals (29). For example, two antelope 
subgroups, the tribes Alcelaphini and Aepycerotini, 
diverged 6 to 8 Ma. The former is now highly 
speciose, with some 7 living and 25 extinct spe- 
cies, and the latter is represented by two species, 
only one, the impala Aepyceros, surviving. The 
slowly evolving Aepycerotini consists of few 
species at any time, and each of those is long 
lived, whereas the speciose Alcelaphini consists 
of many short-lived species. The ecological 
habits of both clades differ: The impala has a 
broad, generalist diet, whereas the speciose al- 
celaphines show more dietary specialization. In 
wider studies of many clades of Neogene African 
and South American mammals (30), the resource- 
use hypothesis was supported, and some subsidiary 
predictions confirmed: Specialists are more com- 
mon than generalists, carnivores include more 
generalists than herbivores, and there are more 
specialists in habitats that underwent recent en- 
vironmental change (tropical rain forests and 
deserts). The resource-use model then stresses 
the role of climate and tectonic movements in 
determining species diversity rather than biological 
controls such as competition and predation. 


Outlook 


Paleontologists and evolutionary ecologists have 
debated species diversity largely independently. 
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The realization that the Red Queen and Court 
Jester models may be scale-dependent, and that 
evolution may be pluralistic (3), opens oppor- 
tunities for dialog. Taxic studies in paleontology 
continue to have great value in highlighting cor- 
relations between species richness and other fac- 
tors, but comparative phylogenetic methods will 
illuminate questions about clade dynamics, spe- 
cies richness, and the origin of novelties. Further, 
methods are shared by paleontologists and ne- 
ontologists, and this allows direct communication 
on the patterns and processes of macroevolution. 
Viewed close up, evolution is all about biotic 
interactions in ecosystems (Red Queen model), 
but from further away, the large patterns of bio- 
diversity are driven by the physical environment 
(Court Jester model). 
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REVIEW 


Adaptive Radiation: 


Contrasting Theory with Data 


Sergey Gavrilets?* and Jonathan B. Losos” 


Biologists have long been fascinated by the exceptionally high diversity displayed by some 
evolutionary groups. Adaptive radiation in such clades is not only spectacular, but is also an 
extremely complex process influenced by a variety of ecological, genetic, and developmental 
factors and strongly dependent on historical contingencies. Using modeling approaches, we 
identify 10 general patterns concerning the temporal, spatial, and genetic/morphological 
properties of adaptive radiation. Some of these are strongly supported by empirical work, whereas 
for others, empirical support is more tentative. In almost all cases, more data are needed. Future 
progress in our understanding of adaptive radiation will be most successful if theoretical and 
empirical approaches are integrated, as has happened in other areas of evolutionary biology. 


that Darwin sought to explain in On the 

Origin of Species emerged through a va- 
riety of intricate biological processes. One of 
these is adaptive radiation, which some consider 
of foremost importance and potentially responsi- 
ble for much of the ecological and phenotypic 
diversity of life (/, 2). “Adaptive radiation” refers 
to those evolutionary groups that have exhibited 
an exceptional extent of adaptive diversification 
into a variety of ecological niches (2-4), with 
such divergence often occurring extremely rap- 
idly (5). Classic examples of adaptive radiation 


Te spectacular diversity of life on Earth 
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include Darwin’s finches on the Galapagos is- 
lands, Anolis lizards on Caribbean islands (Fig. 1), 
Hawaiian silverswords, and cichlids of the East 
African Great Lakes (Fig. 2), among many others 
C/, 2, 6). 

Adaptive radiation has two components: the 
production of new species (speciation) and the 
adaptation of constituent species to a diversity of 
ecological niches. Although many classic adapt- 
ive radiations are both species rich and adaptively 
disparate (7), this correlation is far from perfect: 
Some adaptive radiations have relatively low spe- 
cies richness (e.g., Darwin’s finches, Australian 
pygopodid lizards); alternatively, some speciose 
clades contain little adaptive disparity and thus 
would not qualify as adaptive radiations (3, 8). 

The classic view of adaptive radiation focuses 
on ecological opportunity, in which an ancestral 
species finds itself in an environment in which 
resources are abundant and underutilized. Such 
resource availability may result from coloniza- 


tion of an underpopulated area (e. g., island or 
lake), extinction of previously ecologically dom- 
inant groups, or evolution of a character— 
sometimes termed a “key innovation’”—that 
allows the lineage to interact with the environ- 
ment in novel ways (/, 2). Different evolutionary 
factors allowing the populations to take advan- 
tage of new ecological opportunity have been em- 
phasized, including genetic drift in small founder 
or peripheral populations (9), relaxed (9) or strong 
selection (2, /0), and hybridization (1/, 2). 


Empirical Approaches for Studying 
Adaptive Radiation 


Four main empirical approaches have been used: 
Fossils. Methods based on fossil data allow 
one to infer the history of the clade through time 
and to use information from extinct taxa. The 
disadvantages of this approach are incomplete- 
ness of the fossil record, difficulty in assessing the 
adaptive significance of phenotypic variation among 
taxa, and the absence of ecological, behavioral, 
physiological, and other types of data. 
Phylogenetic comparative methods. Phylo- 
genetic approaches take advantage of increas- 
ingly complete phylogenies for many important 
groups and have the ability to integrate studies of 
the evolution of organismal function and ecology. 
The main disadvantage of these methods is that 
extinct taxa are often not represented so that there 
is no way, for example, to detect whether a clade 
has been more species-rich in the past. Moreover, 
phylogenetic inferences about character states in 
the past can be unreliable (73, 14). 
Microevolutionary studies of extant taxa. 
Studies focusing on traits of and processes affect- 
ing extant taxa—e.g., phenotypic characters, eco- 
logical niches, spatial structure, genetics, local 
adaptation, competition, and sexual selection— 
can elucidate much about the processes driving 
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Fig. 1. Adaptive diversification in Caribbean Anolis lizards. On each island in the Greater Antilles, anoles have 
diversified to produce the same set of habitat specialists. Shown here (clockwise from top left) are A. cybotes, 
trunk-ground specialist, Hispaniola (photo: J. Losos); A. pulchellus, grass-bush, Puerto Rico (photo: J. Losos); 
A. insolitus, twig, Hispaniola (photo: L. Mahler); A. chlorocyanus, trunk-crown, Hispaniola (photo: M. Losos). 


adaptive radiation and sometimes even include 
manipulative experiments. However, extrapola- 
tion from processes operating today to what hap- 
pened early in the history of a radiation is 
problematic; in the past, different processes may 
have operated or the outcome of these processes 
may have been different. These differences might 
result because conditions may have been differ- 
ent early in a radiation’s history, including greater 
resource abundance, less ecological specializa- 
tion, and less genetic canalization. Alternatively, 
one can study taxa that may be in early stages of 
radiation (e.g., stickleback fishes), but such eco- 
logically and morphologically nondisparate groups 
are not necessarily good models for the early stages 
of adaptive radiation. 

Adaptive radiation in the laboratory. The ad- 
vantage of studies of adaptive radiation in mi- 
crobial microcosms [e.g., (J5, /6)] and digital 
cyberworlds (/7) is experimental flexibility. How- 
ever, it is not clear whether these systems are 
good analogs to adaptive radiations in nature. 


Mathematical Modeling 


Adaptive radiation can also be studied by using 
mathematical methods. The latter have tradition- 
ally played a major role both in evolutionary 
biology and in ecology. For example, the modern 
synthesis of the 1930s and 1940s was a direct 
result of the development of theoretical popula- 
tion genetics by Fisher, Wright, and Haldane 
(8). A quantitative theory of speciation, which 


has emerged over the past 40 years [reviewed in 
(19-21)], has clarified many questions and topics 
hotly debated by generations of biologists. The 
main advantages of mathematical modeling are 
its generality, the ability to identify crucial pa- 
rameters, factors, and relevant temporal and spatial 
scales, as well as to point to the gaps in biological 
knowledge and intuition. However, the biological 
realism of models and their underlying assump- 
tions can always be questioned. 

Our goal here is threefold. First, we summa- 
rize recent theoretical findings on the dynamics 
of adaptive radiation. Second, we attempt to test 
theoretical predictions against existing data ob- 
tained using a variety of empirical approaches. 
Third, we identify important areas of theoretical 
and empirical research where notable advances 
can soon occur and can contribute dramatically to 
improving our understanding of adaptive radia- 
tion and the generation of biodiversity. 


Dynamic Models of Adaptive 
Radiation—Extensions on Speciation Theory 


Adaptive radiation can be viewed as the pro- 
cesses of speciation and adaptation extended to 
larger spatial and temporal scales. In classifying 
mechanisms of speciation (and, by extension, 
adaptive radiation), different approaches are pos- 
sible. Most commonly, mechanisms of speciation 
are discussed according to the level of migration 
between the diverging (sub)populations (9, 22). 
In this classification, the basic modes of specia- 


tion are allopatric, parapatric, and sympatric, cor- 
responding to zero, intermediate, and maximum 
migration, respectively (/9-2/). Alternatively, 
mechanisms of speciation can be classified accord- 
ing to biological mechanisms driving divergence 
and speciation, e.g., random drift, ecological se- 
lection, or sexual selection (23, 24). 

However, sometimes very different biological 
mechanisms have underlying dynamic common- 
alities and therefore can be described by very sim- 
ilar mathematical models. Therefore, classifying 
mechanisms of speciation and adaptive radiation 
on the basis of similarity of the corresponding 
models can be more general and insightful than 
on the basis of particular biological factors. 

Five partially overlapping sets of models can 
be identified. In “spontaneous clusterization” 
models, an initially random mating population 
accumulates a substantial amount of genetic var- 
iation by mutation, recombination, and random 
drift and then splits into partially or completely 
reproductively isolated groups [reviewed in (20)]. 
Spontaneous clusterization models describe the 
accumulation of Dobzhansky-Muller genetic in- 
compatibilities, speciation by hybridization, di- 
vergence in mating preferences, or allochronic 
speciation (i.e., speciation via divergence in the 
timing of life-cycle events related to reproduc- 
tion). In “invasion of empty niches” models [e.g., 
(25, 26)], a few founders enter a new environment 
with a number of novel discrete ecological niches. 
As selection acts on the new genetic variation 
supplied by mutation, different lineages become 
adapted to and simultaneously develop genetic 
preferences for different ecological niches. Eco- 
logical and phenotypic diversification is accom- 
panied by the growth in the densities of emerging 
species (Fig. 3). The process of local adaptation is 
opposed by deleterious gene flow from subpop- 
ulations adapting to alternative niches. Under 
these conditions, a reinforcement-like process can 
result in the evolution of premating reproductive 
isolation between different locally adapted groups. 
“Selection gradient” models [e.g., (27, 28)] are 
similar to “invasion of empty niches” models ex- 
cept that environmental conditions vary in a con- 
tinuous gradient-like fashion and selection for 
local adaptation acts on a single quantitative trait, 
the optimum value of which changes linearly ac- 
ross space. In “sympatric diversification” models 
[e.g., (29, 30)], a spatial component is not present, 
so that these models describe sympatric speciation. 
In these models, emerging species specialize on 
relatively narrow parts of a continuous unimodal 
distribution of abundances of a particular set of re- 
sources. In the fifth set of models, diversification is 
driven by coevolutionary interactions [e.g., (37)]. 


Ten Patterns of Adaptive Radiation 


In mathematical modeling, details do matter and 
generalizations are often difficult to make. But 
some general patterns do emerge in models of 
speciation and adaptive radiation. Here we re- 
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view and evaluate with empirical data 10 pat- 
terns concerning temporal, spatial, and genetic/ 
morphological properties of adaptive radiation. 
In some cases, mathematical models and general- 
izations from data have suggested the same pat- 
terns independently. 

1) Early burst of evolutionary divergence: 
Typically, there is a burst of speciation and 
morphological diversification soon after the be- 
ginning of the radiation rather than similar rates 
through time. 

Mathematical models of adaptive radiation 
that explicitly describe microevolutionary pro- 
cesses predict a burst of speciation soon after 
colonization (25, 26). This happens because the 
rate of speciation declines as fewer niches remain 
empty (reduced ecological opportunity) and spe- 
cies become more specialized (increased genetic 
constraints). In addition, phenomenological null 
models of clade diversification (32, 33) predict 
that morphological disparity grows more rapidly 
than species diversity early in the clade history 
due to the structure of multidimensional pheno- 
type space and contrasting effects of speciation 
and extinction on disparity and diversity. 

Both paleontological and phylogenetic com- 
parative data indicate that ecologically important 
morphological variation often arises early in a 
clade’s history. The radiation of placental mam- 
mals is a classic example, with most modern or- 
ders appearing in the fossil record within a short 
period (34). Many fossil groups experience a peak 
in morphological disparity relatively early in their 
history and subsequently decline (35). Phyloge- 
netic studies also indicate that major ecological 
differences often evolve early in clade history, 
such as tree versus ground Darwin’s finches or the 
“ecomorph” habitat specialists of Anolis (12, 36). 

An increasingly common theme in molecular 
studies of speciation is the finding of a burst of 
species diversification early in a clade’s history 
(37, 38). This pattern is also seen in the fossil 
records for some groups. This finding is often dis- 
cussed in the context of adaptive radiation as evi- 
dence for the early burst model of diversification. 
However, implicit in this discussion is the assump- 
tion that species diversity is correlated with adapt- 
ive diversity, an assumption that probably is often, 
but not always, correct. Coupling of studies of 
species diversification and the evolution of adapt- 
ive disparity is an important new direction (39). 

2) Overshooting: An early increase in species 
diversity is followed by a decline that plateaus (or 
substantially decelerates). 

Overshooting can result if speciation rate is 
decreasing and/or extinction rate is increasing in 
time. The rate of speciation can decrease because 
fewer niches remain empty (reduced ecological 
opportunity) and species become more special- 
ized (increased genetic constraints). Mathemati- 
cal models provide only partial support because 
overshooting is predicted only under some con- 
ditions (25). 


There are several ways to examine these ideas. 
The most direct is to follow a radiation through 
history. Some clades do, in fact, exhibit an early 
peak in species richness, followed by a decline and 
then leveling-off of diversity (40); however, al- 
ternative interpretations exist to explain these pat- 
terns (4/). 

An alternative approach is to compare related 
clades of different ages, with the assumption that 
all else is equal [but see (42)]. Taking this ap- 
proach, overshooting has also been demonstrated 
in a comparison of the Zetragnatha spiders of the 
Hawaiian islands, in which the relatively young 
(but not youngest) island of Maui has the most 
species; the high diversity is attained by having 
several niches occupied by sets of allopatric spe- 
cies. Gillespie (43) hypothesized that in time, the 
ranges of such species expand and come into 


contact, at which point competitive exclusion oc- 
curs, leading to a decrease in species richness on 
older islands. Among African Rift lakes, the de- 
pendence of cichlid species diversity on time, 
once corrected for lake surface area, appears to be 
L-shaped [figure 2d in (38)], as expected under 
overshooting. 

3) Stages of radiation: All else being equal, the 
following sequence of the diversification events is 
expected: (i) divergence with respect to macro- 
habitat; (ii) evolution of microhabitat choice and 
divergence with respect to microhabitat; (iii) di- 
vergence with respect to traits that simultaneously 
control the degree of local adaptation and nonran- 
dom mating; and (iv) divergence with respect to 
other traits controlling survival and reproduction. 

This hypothesis is mostly based on general- 
izations from the mathematical theory of speci- 


Fig. 2. Adaptive diversification in cichlids. Several examples of cichlids that use different habitats. (Upper 
left) Gnathochromis permaxillaris occurs in the deeper (>35 m) intermediate (rock-sand interface) habitat 
with sediment-rich bottoms in Lake Tanganyika. (Upper right) Lethrinops furcifer is found in sandy 
environments near beaches in Lake Malawi. (Center) Hemitilapia oxyrhynchus, which is found in shallow 
vegetated habitats in Lake Malawi. (Lower left) Cyprichromis sp. “Leptosoma Jumbo” (Nkondwe) occurs in 
open water in Lake Tanganyika. (Lower right) Petrotilapia nigra occurs in the upper part (>10 m) of the 
sediment-free rocky habitat of Lake Malawi. [Photographs by A. Konings] 
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ation (20). Only a limited amount of theoretical 
work has so far been done with models de- 
scribing the evolution of various types of traits 
for a sufficiently long time. But in these cases, 
simulations did support the above hypothesis 
(25, 26). However, many factors can interfere 
with the order in which niche axes are partitioned 
during species divergence, such as strength of 
selection, available genetic variation, extent of 
environmental heterogeneity, or historical con- 
tingencies such as the ancestral starting condition 
of a clade (2, 20). 

Stages hypotheses have been proposed for 
mountain New Guinea birds (divergence in hab- 
itat elevation first, followed by segregation in diet, 
foraging techniques, and size), Phylloscopus leaf 
warblers (order of divergence: body 
size, foraging morphology and be- 
havior, habitat), cichlids and parrot- 
fish (habitat, diet, sexually selected 
traits), and Anolis lizards (body size, 
structural microhabitat, microclimate) 
[reviewed in (44, 45)]. However, 
these hypotheses have been proposed 
on the basis of phylogenetic analyses 
that may be incapable of accurately 
identifying character states during 
early stages of a radiation for traits 
that are evolutionarily labile (45). 

4) Area effects: Speciation driv- 
en by ecological factors is promoted 
by larger geographic areas (e.g., of 
islands or lakes). 

Both mathematical models (20, 25) 
and verbal theory show that several 
factors might contribute to increas- 
ing speciation rates with area. First, 
larger areas imply that larger popu- 
lation sizes can be achieved, lead- 
ing to increased selection efficiency 
and more advantageous mutations 
for selection to act upon. Second, 
the environment may be more het- 
erogeneous in larger areas, thus con- 
taining more ecological niches. For 
example, a metric of niche availa- 
bility better explains the extent of 
species diversity in Galapagos snails 
than does island area (46). Third, 
larger areas may allow for more 
opportunity for allopatric or para- 
patric speciation (25, 47). In all 
three cases in which this hypothesis 
has been examined (Caribbean Anolis lizards, 
Galapagos snails, and African cichlids), the area 
effect results in part because a threshold island/ 
lake area exists below which in situ cladogenesis 
does not occur, even though some islands below 
the threshold are environmentally quite hetero- 
geneous (38, 46, 48). 

5) Nonallopatric diversification: Speciation 
during adaptive radiation can occur in the ab- 
sence of allopatry. 


In the traditional view [e.g., (9, 49)], divergent 
adaptive evolution occurred in allopatric popula- 
tions that adapted to different ecological niches 
and thus incidentally evolved reproductive iso- 
lation. Subsequent sympatry could lead to rein- 
forcement of incomplete reproductive isolation 
and enhancement of ecological differences (char- 
acter displacement). 

Recent years have seen a resurgence of inter- 
est in nonallopatric modes of speciation driven by 
divergent natural selection pressures (/9, 20, 50). 
In line with these theories, some see adaptive ra- 
diation as the expected outcome of sympatric or 
parapatric speciation in the context of abundant 
ecological opportunity. Mathematical models have 
identified a number of conditions promoting non- 


Fig. 3. An illustration of an adaptive radiation in a mathematical model (25, 26). 
Each square represents a unit spatial area (patch). The color of the square 
defines the ecological niche assigned to the patch. Each local population is 
represented by a circle. The radius of the circle is proportional to the pop- 
ulation size. The color of the circle defines the niche preferred by most 
individuals. Matching of the color of the corresponding square and circle 
(observed in most cases) means that most individuals in the patch prefer the 
ecological conditions they experience. In the case shown, there are eight 
different groups of species utilizing eight different ecological niches. Each of 
these groups is composed of two or three different species differentiated by 
mating preferences. 


allopatric speciation, including strong disruptive 
selection, strong nonrandom mating, high levels 
of genetic variation, close correlation of traits ex- 
periencing disruptive selection with those control- 
ling nonrandom mating, and absence of costs of 
being choosy in mating (/9, 50). How common 
these conditions are in nature is controversial 
(20, 21). 

The strongest evidence for nonallopatric adapt- 
ive radiation comes from examples of diversifica- 
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tion in areas in which it is hard to envision an 
allopatric phase. The classic case is the clade of 11 
cichlid species found in a small and environmen- 
tally homogeneous crater lake in Cameroon 
(51). At the same time, the lack of speciation in 
some groups on relatively large and environmen- 
tally heterogeneous, but isolated, islands suggests 
that geographic isolation may be needed for 
speciation to occur, as discussed above. 

6) Selection gradient effect: Selection gra- 
dients of intermediate slopes promote speciation. 

Models of parapatric speciation in which 
spatially heterogeneous selection for local adap- 
tation acts on an additive trait (27, 28) show that 
if selection gradients are too shallow, differences 
in selection experienced in different parts of the 
species range are not strong enough 
to drive divergence and speciation. 
If selection gradients are too steep, 
selection is so strong that it prevents 
the population’s spread into unoc- 
cupied areas with different selec- 
tion. As a result, intermediate slopes 
of selection gradients are most con- 
ducive for speciation. This predic- 
tion is supported by a recent study of 
divergence in male coloration and 
female preferences in cichlids (52). 

7) Spatial dimensionality effect: 
All else being equal, geographic 
areas that can approximately be 
viewed as one-dimensional (such as 
rivers or shores of lakes and oceans) 
promote more speciation and tend to 
maintain higher species richness and 
phenotypic and genetic diversity per 
unit area than geographic areas that 
are two-dimensional (such as lakes, 
oceans, and continental areas). 

This prediction follows both from 
classical results on patterns of geo- 
graphic variation in neutral loci in- 
duced by isolation by distance [e.g., 
(53)] and from speciation models 
(20). This spatial dimensionality 
effect may have contributed to the 
extraordinary divergence of cichlids 
in the great lakes of Africa, most 
species of which inhabit the rela- 
tively narrow band along the shore- 
line and have extremely restricted 
dispersal abilities. However, it does 
not account for the low diversity of 
cichlids in African rivers (54). In addition, radi- 
ation of other lacustrine taxa is not limited to di- 
versification in shoreline habitats (55). Moreover, 
in the oceans, some types of speciation and evo- 
lutionary diversification occur more readily in 
open, two-dimensional habitats than along con- 
tinental shelves (56). 

8) Least action effect: Speciation occurring 
after the initial burst usually involves a minimum 
phenotypic change. 
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This pattern is explained by the joint action of 
genetic constraints and selection for local adap- 
tation: As each lineage gets more specialized for 
a particular ecological niche, it becomes more 
and more difficult to establish adaptations for al- 
ternative niches (25, 26). The widely observed 
pattern of phylogenetic niche conservatism, in 
which closely related species are ecologically and 
phenotypically similar, is consistent with this pat- 
tern (57, 58). 

9) Effect of the number of loci: Rapid and 
extensive diversification is most likely if the 
number of underlying loci is small. 

This is one of the most general patterns emerg- 
ing from a variety of models (20, 25, 26). A 
smaller number of loci implies that their individ- 
ual contributions to the traits are larger, the effect 
of selection on each locus is stronger, the number 
of genetic changes necessary for divergence is 
smaller, and recombination and migration are less 
efficient in destroying the coadapted combina- 
tions of genes that emerge during speciation. This 
prediction is supported by data on cichlids, finches, 
and monkeyflowers, all of which indicate the ex- 
istence of a small number of loci underlying traits 
involved in ecological interactions and reproduc- 
tion (59-61). 

10) Porous genome effect: Species can stably 
maintain their divergence in a large number of 
selected loci for very long periods despite substan- 
tial hybridization and gene flow that decreases or 
removes differentiation in neutral markers. 

This effect has been observed in many 
different models (25, 26). A related prediction is 
that during earlier stages of diversification, when 
reproductive isolation is still weak, new traits 
(ecological or involved in mating) that emerge in 
some lineages can spread to other lineages as a re- 
sult of hybridization. 

Botanists have long been aware that sympatric 
and ecologically differentiated taxa can maintain 
their distinctiveness in the face of ongoing hybrid- 
ization. Recent years, however, have shown that a 
similar pattern also occurs in many species of ani- 
mals (62). Indeed, recent work has suggested that 
such interspecific gene flow can be an important 
source of genetic variation that enables adaptive 
diversification in early stages of radiation (//, /2). 

On the other hand, the homogenizing effect 
of gene flow has classically been considered an 
impediment to evolutionary divergence (9), a role 
that it probably plays in many cases (63). More- 
over, in some cases in which reproductive iso- 
lation is based on ecological differences among 
species, environmental perturbations can negate 
these isolating mechanisms, leading previously 
genetically distinct taxa to interbreed and even 
“de-speciate” (64). 


General Conclusions 
This brief survey suggests the following: 

1) More empirical studies are needed spe- 
cifically aimed at assessing the predictions 


discussed above. At present, we are left with 
a haphazard set of studies that happen to be 
relevant. 

2) More generally, evolutionary biology is an 
inductive science in which we establish general- 
ities by the accumulation of case studies. The 
number of adaptive radiations that have been 
extensively studied from the many different per- 
spectives relevant to our discussions is surpris- 
ingly small. More detailed studies, integrating 
across a variety of approaches and disciplines, is 
needed to build a reservoir of case studies from 
which generalizations can be drawn. 

3) We need studies of general models aiming 
to capture the most widespread patterns of adapt- 
ive radiation. At the same time, we need models 
that are more closely tailored to the biology of par- 
ticular taxa. For example, some models (25, 26) 
assume that mating occurs in the ecological niche 
that a species exploits; this assumption is true for 
some taxa, such as some host-specific insect 
herbivores, but not for other classic adaptive ra- 
diations. The extent to which relaxing these 
assumptions or tailoring them to other biological 
situations (e.g., different species concepts) would 
change the predictions needs to be explored. The 
complexity of the processes of adaptive radiation 
is reflected in the complexity of corresponding 
mathematical models. Using the emerging tools 
of high-performance computing will be crucial 
for better understanding of models (and nature). 

Darwin was confronted with the fruits of 
adaptive radiation throughout his 5-year journey 
around the world, and their evolutionary exuber- 
ance made an impression on him. Speaking of the 
Galapagos finches that now bear his name, he 
wrote in The Voyage of the Beagle: 

“Seeing this gradation and diversity of struc- 
ture in one small, intimately related group of birds, 
one might really fancy that from an original pau- 
city of birds in this archipelago, one species has 
been taken and modified for different ends.” 

In the 150 years since publication of the 
Origin, adaptive radiations have continued to 
astonish and inspire scientists and the public 
alike. But how exactly radiation occurs, and how 
it differs among taxa and in different settings, as 
well as why some lineages radiate and others do 
not, are still unclear. Most likely this is because 
there is no single answer: Lineages vary in mani- 
fold ways, various evolutionary factors act simul- 
taneously, similar evolutionary outcomes can be 
achieved via alternative paths, and the contingen- 
cies of place and time play a large role in guiding 
the evolutionary process. 
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REVIEW 


Evidence for Ecological Speciation 


and Its Alternative 


Dolph Schluter 


Natural selection commonly drives the origin of species, as Darwin initially claimed. Mechanisms of 
speciation by selection fall into two broad categories: ecological and mutation-order. Under 
ecological speciation, divergence is driven by divergent natural selection between environments, 
whereas under mutation-order speciation, divergence occurs when different mutations arise and 
are fixed in separate populations adapting to similar selection pressures. Tests of parallel evolution 
of reproductive isolation, trait-based assortative mating, and reproductive isolation by active 
selection have demonstrated that ecological speciation is a common means by which new species 
arise. Evidence for mutation-order speciation by natural selection is more limited and has been 
best documented by instances of reproductive isolation resulting from intragenomic conflict. 
However, we still have not identified all aspects of selection, and identifying the underlying genes 


for reproductive isolation remains challenging. 


but at last we can agree with Darwin that the 

origin of species, “that mystery of mysteries” 
(J), really does occur by means of natural selection 
(2-5). Not all species appear to evolve by 
selection, but the evidence suggests that most of 
them do. The effort leading up to this conclusion 
involved many experimental and conceptual ad- 
vances, including a revision of the notion of 
speciation itself; 80 years after publication of On 
the Origin of the Species, to a definition based on 
reproductive isolation instead of morphological 
differences (6, 7). 

The main question today is how does selec- 
tion lead to speciation? What are the mechanisms 
of natural selection, what genes are affected, and 
how do changes at these genes yield the habitat, 
behavioral, mechanical, chemical, physiological, 
and other incompatibilities that are the reproduc- 
tive barriers between new species? As a start, the 
many ways by which new species might arise by 
selection can be grouped into two broad catego- 


I took evolutionary biologists nearly 150 years, 


Biodiversity Research Centre and Zoology Department, 
University of British Columbia, Vancouver, BC V6T 124, 
Canada. E-mail: schluter@zoology.ubc.ca 


ries: ecological speciation and mutation-order 
speciation. Ecological speciation refers to the 
evolution of reproductive isolation between pop- 
ulations or subsets of a single population by ad- 
aptation to different environments or ecological 
niches (2, 8, 9). Natural selection is divergent, 
acting in contrasting directions between environ- 
ments, which drives the fixation of different 
alleles, each advantageous in one environment 
but not in the other. Following G. S. Mani and 
B. C. Clarke (/0), I define mutation-order specia- 
tion as the evolution of reproductive isolation by 
the chance occurrence and fixation of different 
alleles between populations adapting to similar 
selection pressures. Reproductive isolation evolves 
because populations fix distinct mutations that 
would nevertheless be advantageous in both of 
their environments. The relative importance of 
these two categories of mechanism for the origin 
of species in nature is unknown. 

In this review, I summarize progress in under- 
standing the general features of speciation by se- 
lection. I do not differentiate speciation by sexual 
selection here because natural selection drives the 
divergence of mate preferences, by either eco- 
logical or mutation-order mechanisms, in most 


theories of the process (8, //). I leave out dis- 
cussion of sympatric and allopatric speciation but 
instead identify the likelihood of ecological and 
mutation-order speciation when there is gene 
flow. I ignore reinforcement, a special type of 
natural selection thought to favor stronger pre- 
mating reproductive isolation once postzygotic 
isolation has evolved. I also ignore speciation by 
polyploidy, even though selection might be crucial 
in the early stages. 


Speciation and Adaptation from 
Darwin to Dobzhansky 


Appreciation of the connection between adapta- 
tion and speciation began with Darwin when a 
morphological concept of species largely pre- 
vailed. In On the Origin of Species, Darwin wrote 
that “TI look at the term species, as one arbitrarily 
given for the sake of convenience to a set of 
individuals closely resembling each other...” and 
“The amount of difference is one very important 
criterion in settling whether two forms should be 
ranked as species or varieties” (/). Under this 
view, speciation is defined as the accumulation of 
sufficiently many differences between popula- 
tions to warrant their classification as separate 
taxonomic species. Darwin understood the im- 
portance of reproductive barriers between species 
(J), but the study of speciation after the pub- 
lication of this work focused mainly on the evo- 
lution of species differences, particularly of 
morphological traits but also of behavioral and 
other phenotypic traits. 

Under this Darwinian perspective, linking 
speciation with adaptation was relatively straight- 
forward, requiring only a test of whether phenotyp- 
ic differences between species were caused by 
natural selection. For example, at the American 
Association for the Advancement of Science 1939 
speciation symposium [the last major symposium 
on speciation before the biological species concept 
(7)], an extensive comparative and biogeographic 
study showcased instances in which derived mor- 
phological and life history forms of fishes had 
arisen over and over again from the same ancestral 
type (12). The repeated, parallel origin of non- 
parasitic lamprey in streams from the same migra- 
tory, parasitic ancestor showed that “Again and 
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again the parasitic lampreys have evolved into non- 
parasitic forms...correlated with life in small 
streams, where a suitable food supply in the way 
of large fish is scarce or seasonal” (/2). When cor- 
related with environmental factors, such repetition 
is unlikely to result from chance; environmental 
selection pressures must therefore be the cause of 
speciation. “As a result of our recent studies on 
fishes...weight is constantly being added to the 
theory that speciation is...under the rigid control of 
the environment” (/2). However, this case is only 
referring to the origin of morphological species. 

The turning point for speciation studies came 
with the modern concept of speciation “Species 
separation is defined as a stage of the evolu- 
tionary process at which physiological isolat- 
ing mechanisms become developed” (6) (here, 
“physiological” is interpreted to mean evolved 
reproductive isolation between populations, as 
distinct from geographical barriers to interbreed- 
ing). Subsequently, species were defined as 
“groups of interbreeding natural populations that 
are reproductively isolated from other such 
groups” (7). From this point on, the study of 
speciation was the study of the evolution of 
reproductive isolation (3). Progress up to then in 
understanding the link between morphological 
speciation and adaptation was largely forgotten, 
its contributions uncertain under the new concept. 

The biological species concept must surely 
have made it more difficult to investigate any link 
between speciation and natural selection. T. 
Dobzhansky (/3) suggested that the genes under- 
lying differences between populations in ordinary 
phenotypic traits were unlikely to be the basis of 
reproductive isolation. He later changed his mind, 
but at the time this viewpoint, and the generally 
greater difficulty of studying reproductive isola- 
tion than morphology, must have discouraged 
many from pursuing the connection. Virtually no 
research effort followed that tested the role of 
adaptation in speciation. 


Models of Speciation by Selection 


The topic of natural selection in speciation is once 
again receiving attention. The two most general 
hypotheses involving selection are ecological and 
mutation-order speciation. Ecological speciation 
is defined as the evolution of reproductive iso- 
lation between populations by divergent natural 
selection arising from differences between eco- 
logical environments (2, 8, 9, /4). It predicts that 
reproductive isolation should evolve between 
populations adapting to contrasting environments 
but not between populations adapting to similar 
environments. The basic idea has been around for 
a while (7), although it was tested only recently. 
The agents of divergent selection are extrinsic and 
can include abiotic and biotic factors such as food 
resources, climate, habitat, and interspecies inter- 
actions such as disease, competition, and behay- 
ioral interference. Ecological speciation can lead 
to the evolution of any type of reproductive 


A Gambusia 


Fig. 1. (A) Example of ecological speciation. Repeatedly and 
independently, the mosquito fish, Gambusia hubbsi, inhabiting 
blue holes in the Bahamas has evolved a larger caudal region and 
smaller head in the presence of predators (top) than in their 
absence (bottom) (29). In laboratory trials, the probability of two 
individuals mating was higher when they were from different 
populations having the same predation environment (and similar 
body shape) than when they were from opposite predation 
environments. [Photo credit: Brian Langerhans (29)]. (B) Example 
of reproductive isolation evolving under the mutation-order 
mechanism. Male-fertile (left) and male-sterile (right) flowers of 
F2 hybrids between an Oregon population of monkey flowers (M. 
guttatus) having a cytoplasmic male sterility element and nuclear 
restorer and a closely related species (1. nasutus) having neither 
(46, 47). Both flowers shown have M. guttatus cytoplasm. The 
flower on the left also has the nuclear restorer, whereas the one on 
the right, with undeveloped anthers, lacks the restorer. [Photo 


credit: Andrea Case (47)] 


isolation, including premating isolation, hybrid 
sterility, and intrinsic hybrid inviability as well as 
extrinsic, ecologically based pre- and postzygotic 
isolation. Speciation by sexual selection is 
ecological speciation if ecologically based diver- 
gent selection drives divergence of mating 
preferences, for example by sensory drive (/5). 
In accordance with (/0), mutation-order spe- 
ciation is defined as the evolution of reproductive 
isolation by the fixation of different advanta- 
geous mutations in separate populations expe- 
riencing similar selection pressures. Whereas 
different alleles are favored between populations 
under ecological speciation, the same alleles 
would be favored in different populations under 
mutation-order speciation. Divergence occurs any- 


way because, by chance, the pop- 
ulations do not acquire the same 
mutations or fix them in the same 
order. Divergence is therefore sto- 
chastic but the process is distinct 
from genetic drift. It can occur in 
both small and large (though not 
infinite) populations. Selection 
can be ecologically based under 
mutation-order speciation, but 
ecology does not favor diver- 
gence as such. It can lead to the 
evolution of any type of repro- 
ductive isolation, with the excep- 
tion of ecologically based pre- and 
postzygotic isolation. 

Speciation resulting from in- 
tragenomic conflict such as mei- 
otic drive or cytoplasmic male 
sterility (Fig. 1B) is likely to be 
mutation-order speciation be- 
cause, by chance, the initial muta- 
tions causing drive and those 
countering it are unlikely to be 
the same in separate populations. 
Speciation by sexual selection is 
mutation-order speciation if di- 
vergence of mate preferences or 
gamete recognition occurs by the 
fixation of alternative advanta- 
geous mutations in different pop- 
ulations, as by sexual conflict 
(16). Divergence in song and 
other learned components of be- 
havior under purely social selec- 
tion, not molded by selection for 
efficient signal transmission (5), 
is the cultural equivalent of the 
mutation-order process. Addition- 
al scenarios are elaborated in (5). 

Both models of speciation, 
ecological and mutation-order, 
are theoretically plausible, and 
only data can determine their rel- 
ative importance in nature. The 
key is to figure out by which 
mechanism reproductive isolation 
first evolved (3). Once the earliest genetic differ- 
ences have accumulated between populations by 
either process, subsequent mutations might be 
favored in one population and not the other 
because of epistatic interactions with genetic 
background (/0). Hence, epistasis, including that 
producing Dobzhansky-Muller incompatibilities 
in hybrids between species (3), can result from 
either ecological or mutation-order speciation. 

Speciation can be rapid under both speciation 
models, because alleles are driven to fixation by 
natural selection in both cases. However, under 
the mutation-order process, the same alleles, if 
present, would be favored in every population, at 
least in the early stages of divergence. For this 
reason, mutation-order speciation is difficult when 
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there is gene flow, because gene flow increases the 
possibility that favorable mutations occurring in 
one population will spread to other populations, 
preventing divergence (/7, /8). Any process 
resulting in low levels of gene flow, including 
selection, facilitates subsequent divergence by the 
mutation-order process (/9). In contrast, ecological 
speciation can proceed with or without gene flow, 
although it is easiest when gene flow is absent. 

Experiments with laboratory populations of 
Drosophila and yeast demonstrate the plausibility 
of ecological speciation. In those instances when 
measurable pre- and postmating reproductive 
isolation evolved, it was greater between lines 
subjected to different environments than between 
lines raised under homogeneous conditions (20, 2/). 
Laboratory experiments on various microbes main- 
tained under homogeneous conditions for many 
generations have detected genetic divergence con- 
sistent with the mutation-order process (22), but 
effects on reproductive isolation have not been 
explored. 

Two approaches investigate the mechanisms of 
speciation by natural selection in nature. The 
bottom-up approach involves (i) genetic mapping 
of reproductive isolation between closely related 
species, (ii) testing whether discovered genes 
exhibit a genomic signature of positive selection, 
and (iii) identifying the phenotype and source of 
fitness effects of alternative alleles at selected loci. 
The approach has been hugely successful in 
identifying major genes implicated in hybrid 
inviability (Hmr, Lhr, Nup96), sterility (Odsh, 
JYAlpha), and sexual isolation (ds2) between 
Drosophila species. Most of these genes show 
molecular signatures of positive selection, proving 
natural selection’s role (3), provided that fixation 
occurred before complete reproductive isolation 
rather than afterward. The top-down approach 
involves identifying (i) the phenotypic traits un- 
der divergent selection, (1i) those traits associated 
with reproductive isolation, and (iii) the genes 
underlying traits and reproductive isolation. Step 
(iil) has been challenging under both approaches 
but is needed to understand how selection has led 
to reproductive isolation. 


Ecological Speciation 


Evidence for ecological speciation has accumu- 
lated from top-down studies of adaptation and 
reproductive isolation [reviewed in (2, 8, 9)]. We 
now know of many real species that have, at least 
in part, evolved by divergent natural selection 
between environments. The connections between 
selection on ordinary phenotypic traits and repro- 
ductive isolation are often strong and straight- 
forward. It follows that much of the genetic basis 
of reproductive isolation should involve ordinary 
genes that underlie differences in phenotypic 
traits. But we still know little about the genetics 
of ecological speciation. 

One line of evidence comes from tests of 
parallel speciation, whereby greater reproductive 


isolation repeatedly evolves between indepen- 
dent populations adapting to contrasting environ- 
ments than between independent populations 
adapting to similar environments (20, 23). A 
major challenge in applying the test to natural 
populations is to eliminate the possibility that each 
ecotype has originated just once and has spread to 
multiple locales. This is difficult because gene 
flow of neutral markers between closely related 
but nearby populations can result in the false 
appearance of multiple independent origins of 
these populations when evaluated by phylogenies 
(3, 24). However, multiple origins are supported in 
several examples of parallel speciation, including 
the sympatric benthic-limnetic species pairs of 
threespine stickleback in young lakes of British 
Columbia (25, 26), the repeated origin of diver- 
gent marine and stream populations of threespine 
stickleback around the Northern Hemisphere (27), 
ecotypes of Timema walking stick insects living 
on different host plants (28), Littorina marine snail 
ecotypes inhabiting different zones of the intertidal 
(24), and mosquito fish inhabiting blue holes with 
and without fish predators in the Bahamas (29) 
(Fig. 1A). In these studies, it was shown that males 
and females are more likely to mate if they are of 
the same ecotype, regardless of relatedness as 
indicated by phylogenetic affinity. 
Ecological speciation is also 
supported by examples of premat- 
ing reproductive isolation in which 
individuals choose or preferentially 
encounter mates on the basis of 
phenotypic traits that are under 
ecologically based divergent selec- 
tion. Examples include assortative 
mating by host choice in insects, 
body size and coloration in fish, 
beak size in birds, pollinator pref- 
erences for specific phenotypic 
floral traits, and variation in flower- 
ing time—traits inferred to be under 
divergent selection between environ- 
ments [see examples in (8, 30, 3/)]. 
Ecologically based divergent 
selection has also been directly 
measured, as shown by reduced fit- 
ness of each ecotype in the envi- 
ronment of the other [immigrant 
inviability; reviewed in (3/, 32)] 
and by reduced fitness of hybrids in 
the parental environments [extrin- 
sic postzygotic isolation (33)]. For 
example, each of the coastal peren- 
nial and inland annual races of the 
monkey flower (Mimulus guttatus) 
along the west coast of North 
America has low fitness when 
transplanted to the habitat of the 
other (37). This is an example of 
active selection on phenotypic dif- 
ferences, and it also constitutes di- 
rect reproductive isolation because 
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it is an evolved barrier to gene flow between 
parental populations. Multiple traits are probably 
involved, including flowering time and tolerance 
of salt and drought. This type of reproductive 
isolation is context-dependent and is weakened in 
intermediate environments. On the other hand, 
active selection favors the evolution of ever-greater 
differences between populations, which may 
strengthen the barrier to gene flow (20). 

It is unclear how much reproductive isolation 
typically evolves by ecologically based divergent 
selection in nature. We can approximate an answer 
from estimates of the combined contribution of 
active selection on traits and trait-based assortative 
mating, as compared with other forms of re- 
productive isolation (Fig. 2 and table S1). These 
estimates are incomplete because individual 
studies may lack data on components of repro- 
ductive isolation, separate components may not be 
independent, and the strength of barriers between 
species may not be symmetric (34). Nevertheless, 
compilation of the data shows that the amount of 
reproductive isolation attributable to active selec- 
tion and trait-based assortative mating is at least as 
strong, on average, as the amount from compo- 
nents of reproductive isolation lacking identifiable 
causes (Fig. 2). The unidentified component of 
speciation, if built by selection and not genetic 


Components — divergent selection 


Components — unknown cause 
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0.2 04 06 08 1 
Cumulative reproductive isolation 


Fig. 2. Estimates of the magnitude of reproductive isolation 
resulting from divergent selection components (top), compared 
with other components lacking identifiable causes (bottom). 
Divergent selection components include those attributable to 
active selection on traits (immigrant inviability and extrinsic 
postzygotic isolation) and to trait-based assortative mating (habitat 
preference, floral isolation, and breeding time). The unattributed 
components include intrinsic hybrid inviability, sexual selection 
against hybrids, pollen competition, and reduced hybrid fecundity. 
Data were taken from (32, 31) (table $1). A negative value 
indicates that hybrids had higher fitness than the parental species 
for at least one component of postzygotic isolation. One data value 
of —2.66 was left out of the bottom panel. 


www.sciencemag.org SCIENCE VOL 323 6 FEBRUARY 2009 


739 


Downloaded from www.sciencemag.org on February 6, 2009 


740 


Speciation 


drift, could be the result of either ecological or 
mutation-order mechanisms. 

These examples indicate a growing knowledge 
of the mechanisms of selection and its conse- 
quences for reproductive isolation. At this point, the 
most glaring deficiency is our knowledge of the 
impact of selection on genes. Optimistically, pro- 
gress is being made with genetic mapping to identify 
quantitative trait loci (QTLs) and genes or regu- 
latory control regions that affect individual pheno- 
typic traits on which components of reproductive 
isolation depend. Examples include the yup QTL, 
which affects flower color differences between the 
monkey flowers, Mimulus cardinalis and M. lewisii 
(35). Swapping alleles of this QTL between the 
species with repeated backcrossing resulted in shifts 
in pollinator preference and, hence, indirectly af- 
fected premating isolation. Survival and salt toler- 
ance of second-generation hybrids between the 
sunflowers Helianthus annuus and H. petiolaris 
transplanted to the salt marsh habitat of their hybrid 
descendent species (H. paradoxus) mapped strong- 
ly to a QTL identified as the salt tolerance gene 
CDPK3 (36). 

Another form of investigation involves the 
analysis of genome scans of ecologically different 
populations and species. These scans compare 
allelic variation within and between populations at 
many marker loci spaced throughout the genome 
(37). Markers that show excessive differentiation 
between populations (outliers) may indicate selec- 
tion on nearby genes. The method is particularly 
informative when applied to populations with 
ongoing hybridization, because outlier markers 
may identify points in the genome that resist the 
homogenizing influence of gene flow, perhaps 
indicating genomic regions under divergent selec- 
tion. However, sets of genes that diverged under a 
mutation-order process can produce the same 
pattern (17, 18), which makes analysis of such 
studies more difficult. Clues to whether ecologi- 
cally based divergent selection is involved are 
gained if outliers at the same genomic locations 
turn up repeatedly in scans between populations 
that inhabit contrasting environments (38) and by 
identifying phenotypic traits under divergent 
selection that map to those locations in the genome 
(36, 39, 40). As genomic resources increase for 
more species, it will be possible to measure natural 
selection directly on genomic regions of interest by 
transplanting otherwise relatively homogenous ex- 
perimental populations containing alternative alleles 
into the environments of the parent species (35). 


Mutation-Order Speciation 


Mounting evidence for divergent selection in 
speciation does not diminish the potential role of 
mutation-order divergence. It may be that the 
mutation-order process is more difficult to detect, 
or perhaps we have not looked hard enough at 
species with only small ecological differences (5). 
We still do not know much about the selective 
factors causing mutation-order speciation. 


Evidence for mutation-order speciation comes 
from instances in which reproductive isolation 
apparently evolved as a by-product of conflict 
resolution between genetic elements within indi- 
viduals (intragenomic conflict), such as cyto- 
plasmic male sterility in hermaphroditic plants 
(Fig. 1B), and genetic elements conferring 
meiotic drive. Under both mechanisms, a muta- 
tion arises that can distort representation in 
gametes and spreads in a selfish manner, even 
though such an element reduces overall fitness of 
the organism that bears it. This, in turn, places 
selection on mutations in other genes that counter 
the selfish element’s effects and restore more 
equal genetic representation in gametes. Distorter 
and restorer mutations are unlikely to be the same 
in different populations regardless of environ- 
ment; thus the process leads to divergence. The 
mismatch between the distorter in one population 
and the restorer in the other can result in hybrid 
sterility or inviability and, thus, reproductive 
isolation (3, 41). Numerous examples of selfish 
elements, such as those observed in cytoplasmic 
male sterility of plants, support these hypotheses 
(42, 43). In addition, partial reproductive isolation 
generated by meiotic drive has been identified in 
Drosophila [reviewed in (3, 4/)]. Sexual conflict 
is also expected to lead to mutation-order spe- 
ciation, but there are few compelling examples 
(3). The contribution by these mechanisms to 
speciation is still uncertain, however. The alleles 
responsible for meiotic drive and cytoplasmic 
male sterility may be prevented from spreading to 
fixation because selection on such elements is 
frequency-dependent (43) and because restorer 
alleles arise and weaken selection on the distorter 
elements (44). Second, if divergent populations 
come into secondary contact, the alleles within 
each population causing cytoplasmic male steril- 
ity or meiotic drive (and the corresponding 
restorer alleles) will spread between the popula- 
tions by gene flow, eliminating that component of 
reproductive isolation (43). Hence, for these 
mechanisms to contribute to speciation, the fitness 
of hybrids must be reduced to very low levels, or 
other incompatibilities must arise that interact 
with these genes to prevent their spread after 
secondary contact. 


Conclusions 


Our understanding of the role of natural selection 
in speciation has come a long way since Darwin’s 
time. If he were here to witness, he would most 
likely be staggered by the discoveries of genes 
and molecular evolution and astonished at the 
prospect that evolutionary conflict between genes 
could generate reproductive isolation (45). Most- 
ly, [expect that he would be chuffed by mounting 
evidence for the role of natural selection on 
phenotypic traits in the origin of species. This is 
really what On the Origin of Species was all 
about. Between 1859 and the present, the general 
acceptance of the biological species concept 


altered the focus of speciation studies. Yet, the 
discovery that reproductive isolation can be 
brought about by ecological adaptation in ordi- 
nary phenotypic traits bridges Darwin’s science 
of speciation and our own. 

The most obvious shortcoming of our current 
understanding of speciation is that the threads 
connecting genes and selection are still few. We 
have many cases of ecological selection generat- 
ing reproductive isolation with little knowledge of 
the genetic changes that allow it. We have strong 
signatures of positive selection at genes for repro- 
ductive isolation without enough knowledge of 
the mechanisms of selection behind them. But we 
hardly have time to complain. So many new 
model systems for speciation are being developed 
that the filling of major gaps is imminent. By the 
time we reach the bicentennial of the greatest 
book ever written, I expect that we will have that 
much more to celebrate. 
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The Bacterial Species Challenge: 
Making Sense of Genetic and 
Ecological Diversity 
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The Bacteria and Archaea are the most genetically diverse superkingdoms of life, and techniques 
for exploring that diversity are only just becoming widespread. Taxonomists classify these 
organisms into species in much the same way as they classify eukaryotes, but differences in their 
biology—including horizontal gene transfer between distantly related taxa and variable rates of 
homologous recombination—mean that we still do not understand what a bacterial species is. This 
is not merely a semantic question; evolutionary theory should be able to explain why species 
exist at all levels of the tree of life, and we need to be able to define species for practical 
applications in industry, agriculture, and medicine. Recent studies have emphasized the need to 
combine genetic diversity and distinct ecology in an attempt to define species in a coherent and 
convincing fashion. The resulting data may help to discriminate among the many theories of 
prokaryotic species that have been produced to date. 


he species debate in microbiology is not 
[os about a human desire to catalog bac- 

terial diversity in a consistent manner, but 
is also a fundamental argument because of what it 
reveals about our ignorance of how evolutionary 
forces form, shape, and extinguish bacterial ge- 
netic lineages, of the mechanisms of differen- 
tiation between subpopulations sharing common 
descent, and of the process of adaptation to new 
niches and changing environments. Animal spe- 
cies are defined by their morphological and be- 
havioral traits and by their ability or inability to 
interbreed, but such categories cannot easily be 
applied to the Bacteria or Archaea (or indeed to 
many eukaryotic microbes). Instead, taxonomists 
have been forced to rely on biochemical tests and 
limited morphological characteristics for this pur- 
pose. Naturally, biochemical characters have been 
selected for the convenience of taxonomists; they 
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reflect only a tiny subset of those characters that 
allow bacteria to use different resources in the 
environment, and only capture a small fraction of 
the true diversity in this superkingdom of life. 
More recently, molecular methods [particularly 
DNA-DNA hybridization and ribosomal RNA 
(tRNA) sequencing] have helped to define species, 
but these methods have serious limitations and 
cannot reliably assign a large collection of similar 
strains to species (e.g., rRNA sequences are too 
conserved to resolve similar species). rRNA se- 
quence surveys have, however, revealed the extra- 
ordinary variety of microbial life, much of it 
uncultured (/). Beyond this, taxa too similar to be 
distinguished and circumscribed by rRNA se- 
quences have revealed further diversity through 
multilocus sequence analysis (MLSA) (2) and 
metagenomic studies (/), and this diversity needs 
to be explained by theory. Thus, practical dif- 
ficulties, lack of theory, and observations of vast 
amounts of as yet unclassified microbial diversity 
have all fueled the controversy of how one de- 
fines a bacterial species (3—8). 


Genetic Clustering 

Darwin commented that “all true classification 
is genealogical” [(9), p. 404]. Taxonomists have 
thus used sequence relatedness to define cutoff 


same or different species. The overall genetic 
relatedness of isolates may be measured by the 
extent of DNA hybridization between them, and 
those that show 70% or more DNA hybrid- 
ization are defined as the same species (2, /(0). 
Such cutoffs imply that sequences that cluster 
together with a certain amount of similarity 
must be from the same species, and moreover 
that this cutoff value is applicable to all groups 
of bacteria or archaea. Recent MLSA studies, 
which use the concatenated sequences of mul- 
tiple housekeeping genes to discern clustering 
patterns among populations of closely related 
taxa, suggest that species defined by taxono- 
mists in many cases correspond to well-resolved 
sequence clusters. However, these studies also 
show that there is no universal cutoff or descrip- 
tor of clusters that characterizes a species. Fur- 
thermore, inspection of the clusters does not 
always clearly reveal which level in the hierarchy 
is more fundamental than any other (Fig. 1) (7). 

As an example, Fig. 1A shows the relation- 
ships among multiple isolates of three closely 
related streptococcal species. Streptococcus 
pneumoniae is a major human pathogen, S. mitis 
is a commensal bacteria with a history of taxo- 
nomic uncertainty (//), and S. pseudopneumoniae 
is arecently described organism of uncertain status 
that nonetheless corresponds to a distinct cluster in 
these data (/2). There are striking differences in 
the amount of sequence diversity observed within 
homologous housekeeping genes in these named 
species, ranging from 1.2% for S. pneumoniae to 
3.0% for S. pseudopneumoniae and up to 5.0% for 
S. mitis. The distance between two randomly se- 
lected S. mitis genotypes is similar to the average 
distance between S. pneumoniae and S. pseudo- 
pneumoniae genotypes (5.1%) (2). This implies 
that the use of a fixed level of sequence divergence 
for differentiating species would tend to either 
rejoin S. pneumoniae and S. pseudopneumoniae, 
or break up S. mitis so that nearly every isolate was 
a species of its own. This is clearly unsatisfactory. 


Habitats and Ecological Differentiation 


A clear natural criterion to identify clusters of 
evolutionary importance, which we might want 
to call species, is to find ecological features that 
distinguish them from close relatives. Among 
pathogens, the ability to cause a distinctive dis- 
ease has historically been used to define species, 
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Fig. 1. Multilocus sequence analysis of closely related species. (A) Radial 
minimum evolution tree constructed using MEGA4, showing clusters among 97 
isolates of four Streptococcus species identified as indicated. The tree was built 
using concatenates of six housekeeping loci, resulting in a total of 2751 positions 
in the final data set (2). Distances were calculated as the percentage of variant 
nucleotide sites. The mean distance within the clusters, calculated by MEGA4, is 
shown. To the right, the pneumococcal cluster is shown at larger scale, and 
putative subclusters are indicated in dark gray, purple, and green. (B) Ecological 
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associations of Vibrionaceae sequence clusters (13). Habitats (colored dots) were 
estimated as differential distributions of groups of closely related strains among 
samples (size fractions enriched in different environmental resources). Clusters 
associated with named species are evident, and in most cases species show a clear 
predilection for one of the habitats. The exception is V. splendidus, which breaks 
up into many closely related ecological populations. Asterisk denotes that trees 
based on additional loci indicate that the placement of V. panecida within V. 
splendidus may be an artifact of horizontal gene transfer at the Hsp60 locus. 


but pathogens constitute only a minute fraction of 
overall bacterial diversity. Mapping of bacterial 
diversity onto environmental resources indicates 
that closely related groups of bacteria can be eco- 
logically divergent. For example, fine-scale re- 
source partitioning has been observed among 
coastal Vibrio populations coexisting in the water 
column (/3). Partitioning was discovered because 
strains were collected from distinct, ecologically 
informative samples, and the phylogenetic struc- 
ture of the ecologically differentiated populations 
was superimposed on their habitats. Habitats were 
defined using an empirical modeling approach. 
This analysis revealed high levels of specialization 
for some populations (e.g., Vv ordalii is only found 
as single free-swimming cells), whereas others are 
more generalist (Fig. 1B) and can colonize a wide 
variety of surfaces, including organic particles and 
zooplankton in the water column (/3). Most of the 
predicted Vibrio populations are deeply divergent 
from each other, and in many cases are congruent 
with named species; however, V. splendidus is a 
notable exception and splits into numerous closely 
related groups with distinct ecological preferences, 
presumably indicating recent ecological radiation 
from a sympatric ancestral population (/3). Thus, 
genetic clusters that correlate with ecology can be 
discerned. 

What do the genetic data tell us about mech- 
anisms of population differentiation and the 
evolutionary history of the microbes in question? 
That bacteria are organized into genetic clusters is 


not, per se, a very interesting observation; many or 
most models of a population reproducing with a 
small amount of mutation will eventually produce 
populations consisting of clusters of related orga- 
nisms, irrespective of the details of the evolution- 
ary forces or ecological differentiation. A more 
substantial observation is that there is very little 
neutral diversity in many populations of microbes, 
from which we may infer some features of the 
selective landscape. Neutral diversity is the 
amount of polymorphism that is evident in non- 
coding regions or results in synonymous sub- 
stitutions. One common measure of neutral 
diversity is the effective population size N., de- 
fined as the size of a population evolving in the 
absence of selection that would generate as much 
neutral diversity as is actually observed. Esti- 
mates of N, for bacteria range from 10° to 10” 
(/4-18). To put this into context, the numbers of 
Vibrio cells per cubic meter of seawater in tem- 
perate coastal regions range from 10° to 10° (19), 
which suggests vast census population sizes 
(>10°°). This observation—a mismatch of many 
orders of magnitude between effective population 
size and census population size (true of most 
bacteria studied to date)—was originally used to 
counter claims of neutrality and instead argue that 
all genetic variation was adaptive (20, 2/). 
However, there are several different mechanisms 
that can explain this mismatch (Fig. 2). 
Whatever mechanisms are driving the differ- 
entiation of bacteria into clusters, they must re- 


strict the accumulation of neutral diversity. The 
first proposed mechanism was based on artificial 
selection experiments with bacteria grown for 
extended periods under stable conditions in che- 
mostats, which showed repeated selective sweeps 
in which the whole genome was thought to 
hitchhike to fixation along with an advantageous 
mutation (periodic selection) (22). Selective sweeps 
can purge almost all genetic diversity in the pop- 
ulation and thus constitute a candidate mecha- 
nism for reducing neutral variation (23). 


Niches and Ecotypes 


To extend this model, one can consider multiple 
ecological niches characterized by the selective 
advantages they confer to specific genes. This is 
the ecotype model, where genes adapted to 
specific niches cause selective sweeps within 
those niches but not in other niches. In this way 
the population will undergo adaptation and dif- 
ferentiation while maintaining relatively low levels 
of neutral diversity, as selective sweeps confined 
to each ecotype regularly purge the population 
of any diversity that might have accumulated 
(Fig. 2A). Crucially, what neutral diversity we do 
observe is predicted to be associated with adapt- 
ive traits. The ability of such selective sweeps to 
limit the effective population size has been rec- 
ognized for some time (/7, 23), and this model 
has been substantially developed by Cohan and 
colleagues (4, 16, 24). Because it links patterns of 
genetic differentiation with adaptation, and makes 
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reference to the unifying biological principles of 
selection and niche partitioning, the ecotype has 
rightly become popular as a framework within 
which to discuss bacterial evolution, speciation, 
and ecology. 

The ecotype model (4, /6, 24) predicts that 
common ancestry will be preserved among bac- 
terial populations within niches (which should be 
monophyletic), and thus predicts that ecotypes 
are coherent self-contained gene pools. As a re- 
sult, it has been suggested that ecotypes should 
be considered as putative or actual species, de- 
pending on the level of genetic differentiation 
from the ancestral population. This model there- 
fore has the advantage of providing a mechanistic 
understanding of the evolutionary processes, as 
well as an organizing principle for classifying 
species, that is based on experimental observa- 
tions of bacterial populations. 

However, these observations of repeated se- 
lective sweeps were made in chemostats, whereas 
natural environments are markedly unstable and 
diverse. How would one detect the presence of 
selective sweeps in natural bacterial populations? 
The most conclusive examples come not from 
bacteria but from RNA viruses, which mutate at 
much higher rates than DNA-based life forms. It 
has been established from sequences collected 
over many years that the population structure of 
the human influenza virus is predominantly driven 
by repeated selective sweeps (25) and that the 
resulting effective population size N. (<100) is 
very much smaller than observed for bacteria. 
The use of longitudinal ecological and genetic 
data to distinguish between competing models 
of evolution has a long pedigree in eukaryotic 
biology (26). On the basis of these analogies, 
any inference of a population structure driven by 
selective sweeps would require good longitudi- 
nal data from natural bacterial populations, as 
well as observations of episodic crashes in 
diversity causally associated with genetic changes 
and not associated with changes in ecological 
covariates. 


Bottlenecks, Metapopulations, and 
Local Extinctions 


The essential element of the ecotype model with 
respect to limiting neutral diversity is not niche 
adaptation per se, but rather the effective bottle- 
neck caused by the replacement of the whole 
population by descendants from a single indi- 
vidual and the resulting extinction of all other 
lineages (Fig. 2A). Other mechanisms that induce 
or involve regular population bottlenecks will 
also restrict neutral diversity. Metapopulation 
structure, in which the population is divided into 
patches and where individuals disperse between 
patches, can generate very low effective popu- 
lation sizes if patches turn over (i.e., if patches 
are only intermittently able to support bacterial 
growth, and if a small number of bacteria are 
dispersed to colonize empty patches) (Fig. 2B) 
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Fig. 2. Different models of microbial evolution that lead to low values of N.. (A) The ecotype 
model of bacterial population differentiation. The tree shows a single bacterial lineage that dif- 
ferentiates into two sublineages (E1 and E2) that differ in some aspect of their ecology. Periodic 
selection (a selective sweep) occurs at the points marked by asterisks and eliminates almost all of 
the diversity that has arisen since the last episode of periodic selection, which is shown by the 
dashed branches (diversity purged by periodic selection) or solid branches (existing diversity) on 
the tree. As the two populations are ecologically distinct (i.e., ecotypes), periodic selection in one 
sublineage does not influence diversity in the other sublineage and vice versa. Each ecotype can 
therefore diverge to become separate species. Reproduced from (24) with permission. (B) A meta- 
population. Patches of varying size (gray circles) are vacant (empty) or may be colonized by a 
single genotype randomly acquired from another patch. Strains may diversify within a patch (as 
shown by different colors representing distinct genotypes), which may colonize empty patches as 
described above. A characteristic of this sort of metapopulation is patch turnover, in which patches 
occasionally become unable to support colonization and their inhabitants are removed (solid gray 
circles). (C) A neutral model with small population size. Different genotypes (different colors) arise 
by mutation or recombination and increase or decrease in the population by random drift. For 
some purposes, this simple model is an adequate effective description of the more complex pro- 
cesses represented in (A), (B), and (D), and of other more complex evolutionary models not de- 
scribed in this review. (D) Predator-prey dynamics and population bottlenecks. Regular population 
bottlenecks can drastically shrink the effective population size. In this case, bacteria-phage 
predator-prey dynamics are simulated with a classical Lotka-Volterra model, which can generate 
oscillations in population size of any amplitude. Population sizes and time axes are in arbitrary 
units for illustrative purposes only. 


(27). This structure well describes the situation 
for parasites, which can colonize a host but are 
then forced to move on because the host develops 
immunity or dies (/7). It also describes any 
situation where bacteria use a limited resource 
intensively for short bursts, followed by dispersal 
to new resource patches (e.g., colonization of 
organic particles in seawater by Vibrio popula- 
tions). This metapopulation model is fundamen- 
tally different from the ecotype model because it 


does not predict an association between neutral 
diversity and adaptive traits. 

The relevance of the metapopulation model to 
the species question is that, although highly ide- 
alized and simplified, it may capture some of the 
effects of complexity and instability of actual 
ecosystems on population structure. Selective 
sweeps are predicted to be inevitable in simple, 
stable environments but not in complex meta- 
populations [a point partly addressed in (28)]. 
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A metapopulation may evolve, differentiate, and 
adapt without global selective sweeps. Diversity 
lost by a local selective sweep in one patch may 
be rescued and reintroduced from other patches. 
The ecotype model, with its predicted mono- 
phyletic relationship between niche and geno- 
type, may therefore not be an appropriate model 
of speciation in complex ecosystems. 


Choosing Between Models 


It has proven difficult to discriminate between 
models of population differentiation that focus on 
ecotypes or metapopulations. For example, the 
ecotypic structure of a soil Bacillus has been 
modeled to predict a priori which sequence 
clusters were ecotypes, and hence which ones 
should be associated with specific ecological 
properties (/6). Some clusters are associated 
with certain phenotypic traits, such as a propen- 
sity to grow on shady north-facing slopes or sun- 
ny south-facing slopes. However, this model 
fitted no better (and in fact slightly worse) than a 
version of the model with several subpopulations 
and diversity generated only by neutral drift. This 
version of the model was dismissed because of its 
association with a very low estimate of popula- 
tion size (/4). However, estimates of effective 
population size NV, are often grossly disconnected 
from census population sizes. It has proven very 
challenging to find models that successfully 
explain low estimated values of N. while pro- 
viding better predictions than models based on 
simple neutral drift. The analysis of Bacillus 
partly did this by predicting more ecotypes in 
the model than were observed using established 
ecological criteria, a hypothesis that can be 
tested. 

This problem of low power to detect selection 
(or, more accurately, to reject neutrality) is a very 
general problem in population genetics that does 
not negate the importance of adaptation in evo- 
lution, but rather suggests that more work is 
needed if we want model-based methods to dis- 
criminate among different biologically plausible 
explanations of genetic data. In Table 1 we pro- 
pose a scheme for performing analyses that could 
be used to test, develop, and validate different 
competing models more systematically. 


Homologous Recombination 


One specific challenge to models that invoke 
ecotypic structure involves a feature of bacterial 
evolution—homologous recombination—that 
we have not yet discussed. Bacterial reproduction 
does not involve the obligate reassortment of 
genetic material observed in most higher orga- 
nisms. However, recombination does occur in 
bacteria and archaea (29) and typically involves 
the replacement of a short piece of DNA with 
the homologous segment from another strain. 
Recombination becomes less probable with in- 
creasing sequence divergence between the donor 
and the recipient (30, 37), which reduces but does 


not eliminate recombination between closely re- 
lated species. Because of such interspecies recom- 
bination, any given isolate within a species is 
almost certain to contain at least some genetic ma- 
terial that is characteristic of other closely related 
species. Hence, whereas it was once thought that 
bacteria do not form species in the eukaryotic sense 
because they do not recombine at all (32), one 
current view is that they do not form species 
because they recombine too much (5). 

In asexual clonal organisms, even in the ab- 
sence of any selective pressure, clusters will 
spontaneously split into multiple lineages or 
“daughter” clusters (15). However, under certain 
circumstances recombination can prevent this, 
and we can hence divide the bacteria into “‘sexual” 
and “nonsexual” species. This effect, described at 
greater length elsewhere (/5), is summarized in 
Fig. 3, which shows the rate at which two clusters 
diverge over time—that is, the increase in the 
mean genetic distance between them. If this be- 
comes negative, then the two clusters will stop 
diverging and instead converge. The three exam- 
ples shown in Fig. 3 differ only in the rate of 
homologous recombination between the clusters, 
all other parameters being held constant. As re- 
combination increases, we see a distinction be- 
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tween a “clonal” organism in which clusters are 
predicted to diverge (the green line) and a 
“sexual” organism (the blue line) in which they 
are predicted to be held together by recombi- 
nation. For “sexual” species, the divergence of 
clusters requires a process that reduces the rate 
of recombination between them—for example, a 
period of allopatry or ecological differentiation. 
The speciation point is the amount of divergence 
between clusters that needs to accumulate to 
prevent them from returning to a single cluster if 
the barriers to recombination are removed. A 
recent study hypothesized that two related 
Campylobacter species are currently undergoing 
this process of merging into a single species as a 
result of changes in their environment (33). 

The above insights were reached using mod- 
els based on the assumption that genetic variation 
is neutral. Although this is obviously not always 
an appropriate assumption, it is plausible that the 
number of loci explicitly involved in adaptive 
ecological differentiation will be small, and thus 
that in an unstable landscape, genomic barriers 
to recombination will depend more on the accu- 
mulation of differences at neutral loci than at 
adaptive loci. The models also assumed a homo- 
geneous distribution of polymorphisms across 
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Fig. 3. The dynamics of cluster divergence. The figure summarizes some key results from (15) in a 
phase-space plot of the genetic dynamics of two populations, with recombination occurring be- 
tween them at a rate that is varied for the three different simulations. The y axis shows the rate of 
change of genetic distance between the clusters as a function of the genetic distance itself (x axis). 
When the rate of change is positive, the populations will diverge genetically; when negative, they 
converge. The direction of change for each scenario is shown by arrows color-coded to each 
scenario. For low recombination rates, the populations are effectively clonal and always diverge 
(green line). As the recombination rate increases, the cohesive effects of recombination slow the 
rate of divergence, until a threshold is passed (red line) and the populations become effectively 
sexual in the sense that the populations no longer diverge. For recombination rates above this 
level, the fate of the two populations will depend on how genetically distinct they are at the outset. 
If they are within the “speciation point,” then recombination will cause them to merge. If they are 
farther away than this “speciation point,” they will continue to diverge from each other. These 
curves are derived using the model described in (25). 
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the genome, and violation of this may alter the 
tempo and mode of these processes (34, 35). 


Illegitimate Recombination and 
Gene Content Variation 


Illegitimate recombination or gene acquisition is 
another unusual feature of bacteria. In this case, 
genes or clusters of genes are acquired that typ- 
ically have no homolog(s) in the recipient strain. 
The importance of this phenomenon is evident in 
the clear and ubiquitous signature of such events 
in the growing body of genomic data. These are 
identified by differences in the characteristics of 
the acquired DNA and that of the host strain, for 
example, in base composition or co- 
don usage; in most cases, the donor of 
the DNA in question is unknown. 
Gene acquisition leads to genomes 
being punctuated by stretches of 
foreign DNA. The largest of these 
(which may be many kilobases in 
length) were initially termed “patho- 
genicity islands,” because the new 
functions encoded by the imports were 
often involved in virulence, but a bet- 
ter term is “genomic islands” as the 
phenomenon is far from limited to 
pathogens (36, 37). Although it is hard 
to quantify the selective impact of im- 
porting any given gene(s) into a new 
background, the occasional ability to 
gain a new adaptation in this fashion— 
such as a new metabolic capability 
or a new mode of transmission for a 
pathogen—may be of enormous im- 
portance in terms of speciation. 
Perhaps even more striking is the 
amount of variation in gene content 
revealed by multiple genomes from 
the same species, which implies that 
gene acquisition occurs at a sur- 
prisingly high frequency. It is now 
commonplace to speak of the “core” 
genome, which encodes fundamental 
functions shared by all members of a 
species (and, it should go without 
saying, other related species), onto 
which is bolted the “auxiliary” or “accessory” 
genome, composed of genes and operons that 
may or may not be present in all isolates. It seems 
likely that such auxiliary genes help to determine 
the specific ecological properties of the organism. 
For example, a group of related Leptospirillum 
has recently been hypothesized to adapt to dif- 
ferent areas of an acid mine drainage system by 
shuffling of chromosome segments enriched in 
noncore genes (38, 39). We should, however, be 
aware that changes in core genes may also lead to 
ecological differentiation, a phenomenon well 
documented in experimental studies of bacteria 
growing in structured environments (40). 
Estimates vary, depending on the genomes 
that are available, but as little as 40% of genes 
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may be present in all sequenced genomes of a 
named species (4/7). We may consider genes 
within a named species as being characteristic of 
different levels of ecological specificity, rang- 
ing from highly conserved core functions that 
are essential for growth in all environments to 
loci that are involved with adaptation to a specific 
habitat. Some narrow niche-specific genes may 
be distributed across species, being transferred 
between them by mobile elements. The evolu- 
tionary fate of such genes may hence be only 
loosely coupled with that of any particular 
species or strain in which they are found, and 
they are maintained through selection by the 
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which individuals are similar enough, or inter- 
breed enough, that individual variant genes 
compete directly for reproductive success. Prac- 
tical advances building on this or other theoretical 
concepts will only come when these are de- 
veloped into explicit models and model-based 
algorithms that are tested and refined on a wide 
range of data. Alternatively, it may be sensible to 
suggest an ad hoc application of principles to 
different genera on the basis of their specific char- 
acteristics, including the extent of variation in 
gene content and recombination. In any case, no 
biologist would deny the importance of ecology 
to what we observe, but it may not be easy to 
incorporate it in a fashion that is 
convenient for taxonomists. None- 
theless, population geneticists may 
have little choice but to tackle the 
question of defining bacterial species 
or, at the very least, populations. 
Whether we are estimating effective 
population size from neutral diversity 
or choosing an appropriate set of 
strains to test for positive selection at 
a locus of interest, species definitions 
are implicit in much of the analytical 
toolkit of population genetics. 
Distinguishing among mecha- 
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Fig. 4. Differences between core and auxiliary genes. This schematic 
illustrates the relationships between three species in “ecotype space,” 
shown here in two dimensions, and a mobile gene common to all three. 
The areas occupied by the species are shown as solid lines in red, blue, 
and green. The part of the ecological space where the shared mobile gene 
is selected in each species is shown by a dashed purple line and overlaps 
all three species ranges. Examples of (circular) genomes from each species 
with and without the purple mobile element are also illustrated. Note that 
for each species, the locus is not selected for all isolates, and its evolu- 
tionary fate is uncoupled from that of each host species, because if one 
undergoes a selective sweep or goes extinct, the mobile gene may be 
reintroduced from one of the other species. Examples of such distributed 
loci include drug resistance determinants in pathogens (e.g., B-lactamase 
genes) and heavy metal resistance in environmental organisms. These 
genes may be transferred among strains and species by conjugative plas- 
mids or other mobile elements (including transducing phage). 


habitat to which each host strain is adapted. In 
the case of very mobile elements—for exam- 
ple, plasmids encoding resistance to antibiotics or 
heavy metals—the ecological specificity deter- 
mined by these accessory loci may have no link 
to the sequence clusters we observe using house- 
keeping genes (Fig. 4). 


Identifying Mechanisms and 

Delineating Species 

What do we want from bacterial species? Do we 
need theoretical consistency even at the expense 
of taxonomic practicality, incorporating both 
“clonal” and “sexual” populations into a single 
theoretical framework? One unifying theoretical 
concept is to consider species as the arena within 


nisms of population differentiation 
in bacteria ultimately comes down to 
testing the ability of different models 
to explain highly variable patterns 
within and between genetic-ecological 
clusters (Fig. 1). It is still unclear 
whether these patterns are main- 
tained by gene flow or selection, 
and what the effect of population 
structure is. The joint distribution of 
genetic and ecological data can be 
used, as described above for Vibrio 
species (/3), to define populations 
without making a strong theoret- 
ical commitment to either of these 
alternatives. One clear result from 
all of the studies discussed here is 
that the underlying theoretical ques- 
tions concerning species will not be 
answered in the absence of more detailed genetic- 
environmental mapping. Moreover, some guide- 
lines for the types of ecological studies that will 
be most informative are emerging. Most impor- 
tant, the ecological data collected must be rel- 
evant to the niche boundaries of the populations 
studied. And if genetic groups do not map ex- 
clusively onto sampling categories (as is likely 
to be the case), more complex statistical models 
will be needed to identify and describe the under- 
lying niche structure. Longitudinal studies that 
measure the dynamics of ecological associations 
over time will also be helpful to determine how 
transient natural habitats are, and thus how likely 
bottlenecks are to result. Finally, whole-genome 
sequences from entire populations of environ- 
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Table 1. A proposed strategy for developing and validating models of bacterial evolution that 
might eventually be used to classify genetic diversity data and provide a firm foundation for a 


bacterial species concept. 


1. Collect samples according to systematic ecological stratification. Focus on longitudinal studies, 
geographical studies, and measurement of physical and chemical gradients affecting bacterial 
growth. Consider biotic factors such as the presence of other competing bacteria or parasitic phage. 

2. For each isolate, sequence as much as possible and affordable (165 rRNA, MLSA, auxiliary 


genes, full genomes, etc.). 


3. Use empirical classification algorithms that use genetic and ecological data to jointly map isolates. 

4. To guide model formulation, use population genetic tests on observed clusters, focusing on 
tests for selection, population structure, and gene flow. 

5. Generate evolutionary models and simulate populations. 

6. Test, then reject or adapt, evolutionary models according to agreement between simulations 
and real populations; if necessary, return to step 1. 

7. For successful models, develop model-based methods for interpreting pure genetic data 
(without ecological covariates) and test on new data. 

8. If one or more validated models emerge, use these to classify genetic data and to develop 


bacterial species concepts. 


mental bacteria will be useful in dissecting the 
roles of the auxiliary and core genome in 
ecological differentiation. If after this process it 
emerges that some model or models are consistently 
validated for different study systems, these would 
inevitably form a good basis for identifying 
fundamental levels of clustering, or species. 

In the foregoing we have emphasized ecotype 
and metapopulation models, but there are others 
that deserve consideration—notably the epidemic 
clonal model (42) and the impact of phage epi- 
demics causing classic Lotka-Volterra boom-bust 
dynamics (43) illustrated in Fig. 2D—and it is 
possible, even likely, that more than one of these 
mechanisms may be relevant to any given 
problem in speciation and cluster formation. Dis- 
tinguishing among these mechanisms is the 
bacterial species challenge (Table 1), described in 
1991 by John Maynard Smith as follows: “Eco- 
typic structure, hitch-hiking, and localized recom- 
bination can explain the observed patterns of 
variation. The difficulty, of course, is that the 
model is sufficiently flexible to explain almost 
anything. To test the hypothesis of ecotypic 
structure, we need to know the distribution of 
electrophoretic types [1.e., genotypes] in different 
habitats” (17). 

Much research on bacterial species to date has 
come from studies on pathogens, where the cor- 
rect identification of species is crucial for accu- 
rate clinical diagnoses. However, for pathogens 
the identification of the multiple ecological 
niches within (for example) the nasopharynx or 
gut is difficult, and studies of the relationships 
between bacterial populations and ecology may 
be more fruitful for some environmental species 
where the categorization of niches is a more 
tractable enterprise. Hopefully, we will soon 
obtain richer data sets that map bacterial diversity 
onto ecology and provide a way to distinguish 
among various models of population differentia- 
tion and speciation, including those based on 
ecotypes or metapopulations. 
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Is Genetic Evolution Predictable? 


David L. Stern?* and Virginie Orgogozo7* 


Ever since the integration of Mendelian genetics into evolutionary biology in the early 20th century, 
evolutionary geneticists have for the most part treated genes and mutations as generic entities. However, 
recent observations indicate that all genes are not equal in the eyes of evolution. Evolutionarily relevant 
mutations tend to accumulate in hotspot genes and at specific positions within genes. Genetic evolution is 
constrained by gene function, the structure of genetic networks, and population biology. The genetic basis 
of evolution may be predictable to some extent, and further understanding of this predictability requires 
incorporation of the specific functions and characteristics of genes into evolutionary theory. 


ne hundred and fifty years ago, Charles 
Darwin and Alfred Russell Wallace pro- 
posed that biological diversity results 
from natural selection acting on heritable varia- 


tion in populations. Both Darwin and Wallace 
recognized the importance of heritable variation 
to evolutionary theory, but neither man knew the 
true cause of inheritance. Early in the 20th cen- 
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tury, the rediscovery of Mendel’s studies allowed 
for a formal mathematical treatment of alleles 
in populations, generating the field of popu- 
lation genetics. Population geneticists treated 
genes and alleles as generic entities, particles that 
were inherited and somehow caused variation in 
the appearance, behavior, and physiology of 
organisms—what we call collectively the pheno- 
type. This level of abstraction was appropriate 
given that a molecular understanding of gene 
function lay many decades in the future. Even with 
this rudimentary view of gene function, how- 
ever, population genetics greatly clarified how 
real populations evolve, and this theoretical un- 
derstanding spurred the New Synthesis, combin- 
ing population genetics with ecology, systematics, 
and biogeography to explain and explore many 
questions in evolution. 

In the past 40 years, molecular biologists 
have elucidated how genes regulate biological 
processes, but only the most basic mechanistic 
observations have been integrated into evolu- 
tionary biology. For example, evolutionary theory 
has effectively absorbed the distinction between 
coding (nonsynonymous) and silent (synonymous) 
substitutions in protein-coding regions, but other 
aspects of molecular biology currently contrib- 
ute little to evolutionary thought. The time has 
now come to integrate the specifics of molecular 
and developmental biology into evolutionary 
biology. Over the past 15 years, many examples 
of the genes and mutations causing evolution- 
ary change have been identified (/). Patterns in 
these data suggest that a synthesis of molecular 
developmental biology with evolutionary the- 
ory will reveal new general principles of genetic 
evolution. 


Nonrandom Distribution of Evolutionarily 
Relevant Mutations 


Recent studies suggest that the mutations con- 
tributing to phenotypic variation [evolutionarily 
relevant mutations (2)] are not distributed ran- 
domly across all genetic regions. The most com- 
pelling evidence comes from cases of parallel 
genetic evolution: the independent evolution of 
similar phenotypic changes in different species 
due to changes in homologous genes or some- 
times in the same amino acid position of homol- 
ogous genes. 

Many cases of parallel evolution have been 
discovered across all of the kingdoms. At least 
20 separate populations of the plant Arabidopsis 
thaliana have evolved null coding mutations (mu- 
tations that completely eliminate protein func- 
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tion) in the Frigida gene that cause early flowering 
(3). Resistance to DDT and pyrethroids has evolved 
in 11 insect species by mutations in either amino 
acid Leu'©'* or Thr”? of the voltage-gated sodium 
channel gene para (4). Two virus populations 
independently subjected to experimental evolu- 
tion in a novel host accumulated many of the 
same amino acid mutations (5). In total, about 
350 evolutionarily relevant mutations have been 
found in plants and animals, and more than half 
of these represent cases of parallel genetic evo- 
lution (/). 

One explanation for parallel genetic evo- 
lution is that most genes play specialized roles 
during development, and only some genes can 
evolve to generate particular phenotypic var- 
iants. For example, mutations in rhodopsin can 
alter light-wavelength sensitivity (6), and muta- 
tions in /ysozyme may enhance enzyme activity 
at the particular pH of a fermenting gut (7). But 
the reverse would not be true. Mutations in rho- 
dopsin are unlikely to enhance fermentation, and 
mutations in a digestive enzyme will not aid 
detection of a particular wavelength of light, 
even if each protein was expressed in the recip- 
rocal organ. 

Gene function explains part but not all of the 
observed pattern of parallel genetic evolution. In 
several cases, parallelism has been observed even 
though mutations in a large number of genes can 
produce similar phenotypic changes. For exam- 
ple, although more than 80 genes regulate flow- 
ering time (8), changes in only a subset of these 
genes have produced evolutionary changes in 
flowering time (3). Hundreds of genes regulate 
the pattern of fine epidermal projections, called 
trichomes, on Drosophila melanogaster larvae. 
But only one gene, called shavenbaby, has evolved 
to alter larval trichome patterns between Dro- 
sophila species, and this gene has accumulated 
multiple evolutionarily relevant mutations (9). 
What is special about these hotspot genes? 

Developmental biology illuminates why hot- 
spot genes such as shavenbaby exist. During 
development, multiple cell-signaling pathways 
and transcription factors act together to progres- 
sively divide the embryo into a virtual map that 
specifies when and where organs will form. The 
interactions between the genes encoding these 
signaling molecules and transcription factors can 
be represented as a genetic network. Gene inter- 
actions are modulated in large part by the cis- 
regulatory regions of patterning genes. (All genes 
are composed of two fundamentally different re- 
gions: a region encoding the gene product—a 
protein or an RNA—and adjacent cis-regulatory 
DNA that encodes the instructions governing 
when and where the gene product will be 
produced.) Transcription factors bind to cis- 
regulatory regions of target genes, and the summed 
effect of many such interactions at a target gene 
determines whether the gene is expressed or not. 
Patterning genes act within complex genetic net- 
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works, and usually each patterning gene contrib- 
utes to the development of multiple cell types. 
For example, most patterning genes that are active 
during embryonic development of the epider- 
mis contribute to the development of muscle- 
attachment sites, sensory organs, tracheal pits, 
trichomes, or other cell types. 

The importance of regulatory networks in 
determining which genes may be evolutionary 
hotspots can be illustrated with the genetic 
network that governs larval trichome develop- 
ment in D. melanogaster (Fig. 1). In this network, 
developmental patterning genes first collaborate 
to divide the embryonic epidermis into domains 
expressing distinct transcription factors. These 
patterning genes then regulate the expression 
of the shavenbaby gene, a so-called input-output 
gene (/0). Input-output genes integrate complex 
spatiotemporal information (the input) and trigger 
development of an entire program of cell dif- 
ferentiation (the output). The Shavenbaby pro- 
tein activates expression of a battery of target 
genes that transform an epidermal cell into a 
trichome cell. Each target gene triggers a spe- 
cific aspect of cell differentiation, and production 
of a differentiated trichome requires coordinated 
expression of all target genes. The pattern of 
trichomes over the body is thus determined by 
the distribution of Shavenbaby protein in the 
epidermis, which is controlled by the cis-regulatory 
region of the shavenbaby gene. The shavenbaby 
gene serves as a nexus for patterning information 
flowing in and for cell-fate information flow- 
ing out. 

In the entire regulatory network governing 
development of the Drosophila embryo, only 
shavenbaby, with its specialized function to 
rally the entire module of trichome morphogen- 
esis, can accumulate mutations that alter trichome 
patterns without disrupting other developmental 
processes. Genetic changes in upstream devel- 
opmental genes will alter trichome production, 
but these mutations also disrupt other organs. 
Changes in any one of the downstream genes are 
not sufficient to create or eliminate a trichome; 
concerted changes in multiple downstream genes 
are required to build a trichome (//). Further- 
more, all of the evolutionarily relevant mutations 
in shavenbaby that have been identified so far 
alter the cis-regulatory region and not the protein- 
coding region. Mutations in the protein-coding 
region would alter shavenbaby function in every 
cell that accumulates Shavenbaby protein, and 
this would alter every trichome produced in lar- 
vae and adults. Thus, a developmental perspec- 
tive clarifies why shavenbaby is a hotspot for 
evolutionarily relevant mutations and why these 
mutations occur in the cis-regulatory region of 
the gene. We predict that the cis-regulatory re- 
gions of other input-output genes may be hot- 
spots for other phenotypic characteristics. 

The shavenbaby gene provides one example 
of a more general principle: that mutations af- 
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Drosophila melanogaster 


Fig. 1. Morphological divergence between species has been caused by repeated 
evolution at an input-output gene. (A) D. melanogaster and D. sechellia differ in 
the pattern of fine trichomes decorating the dorsal and lateral surfaces of the 
larvae. This difference is caused entirely by evolution of the cis-regulatory region 
of the shavenbaby gene (9). (B) The cis-regulatory region of the shavenbaby 


fecting multiple phenotypic traits, so-called pleio- 
tropic mutations, are unlikely to contribute to 
adaptive evolution. As we discuss next, pleiotro- 
py and other genetic and population-genetic pa- 
rameters seem to influence the distribution of 
evolutionarily relevant mutations. 


The Factors Influencing the Distribution 
of Evolutionarily Relevant Mutations 


Pleiotropy. Some mutations generate specific 
phenotypic changes, whereas pleiotropic muta- 
tions alter several seemingly unrelated traits. 
Two mutations that cause evolutionary increases 
in the number of thoracic bristles in Drosophila 
illustrate the difference between mutations with 
specific and pleiotropic effects (Fig. 2). A cis- 
regulatory change in the scute gene affects the 
number of sensory organs only on the thorax 
(2), whereas a coding mutation in the poils au 
dos gene increases the number of sensory 
organs on both the thorax and the wing (/3). 
The poils au dos mutation is more pleiotropic 
than the scute mutation. Scute, like shavenbaby, 
is an input-output gene, whereas poils au dos is 
a patterning gene that, together with other 
pattering genes, regulates scute expression 
(Fig. 2). Mutations with pleiotropic effects will 
rarely change all phenotypic traits in a favorable 
way, and experimental evidence indicates that 
pleiotropic effects tend to reduce fitness (/4). 
Selection may favor extra bristles on the thorax, 
but not extrasensory organs on the wing. Even if 


Drosophila sechellia 


Hox 
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one effect of a pleiotropic mutation provides a 
major improvement in fitness, the other effects 
may be deleterious and will reduce the likeli- 
hood that the mutation will become established 
in the population (/5). 

Epistasis. When examined in a single genetic 
background, a mutation may have a specific 
or a pleiotropic effect. But in another genetic 
background, the same mutation may produce a 
different phenotypic effect because of nonaddi- 
tive interactions of alleles: so-called epistasis. 
For example, one allele in A. thaliana increases 
growth in one genetic background but reduces 
growth in a different genetic background (/6). 
The second genetic background is not simply 
deleterious in general because a variant allele 
at a second locus causes higher growth in this 
background. Thus, the effects of one mutation 
can depend on the genetic variation present at 
other loci. 

Epistasis is extremely common in natural 
populations and it may sometimes reduce the rate 
of evolution (/7). Epistasis increases the pheno- 
typic variance associated with a particular mu- 
tation, causing a mutation to have a fluctuating 
fitness effect dependent on the genetic back- 
ground. Thus, in an Arabidopsis population con- 
taining multiple genetic backgrounds, we expect 
that selection for increased size will tend to favor 
nonepistatic alleles that increase growth in all 
backgrounds rather than epistatic alleles that 
increase growth in only one genetic background. 
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gene integrates extensive information from developmental patterning genes to 
generate a pattern of Shavenbaby protein expression that prefigures the pattern 
of trichomes on the first-instar larva. Cells accumulating Shavenbaby will dif- 
ferentiate a trichome because Shavenbaby protein regulates a large battery of 
genes that act together to transform an epithelial cell into a trichome (21). 


Plasticity. Populations exposed to repeated 
environmental changes may evolve genetic 
mechanisms that produce different phenotypes 
suited to different environmental conditions: so- 
called phenotypic plasticity. For example, aphids 
can produce multiple phenotypic forms in re- 
sponse to environmental conditions, including 
asexual forms that reproduce quickly and 
sexual forms that lay overwintering eggs. Mu- 
tations that eliminate sexual forms—that reduce 
plasticity—may provide a lineage with a short- 
term advantage, a much faster reproductive rate. 
But in the long term, aphid lineages that do not 
produce sexual forms tend to go extinct, perhaps 
because they fail to adapt to changing environ- 
mental conditions. 

Similarly, in A. thaliana the Frigida gene 
controls plasticity for flowering time. Frigida 
responds to cold temperatures to induce flow- 
ering. In regions with warm winters, null Frigida 
mutations may provide a short-term advantage 
by consistently triggering flowering, even in the 
absence of a cold winter. But these mutations 
eliminate plasticity for flowering time, possibly 
preventing these plants from adapting to colder 
temperatures or from recolonizing areas in colder 
climates. Thus, the abundance of null Frigida 
mutations in Arabidopsis populations must result 
from factors that override the negative conse- 
quences of reduced plasticity. 

Strength of selection. When an environmental 
change favors a phenotype that is vastly different 
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from the mean phenotype in a pop- 
ulation, mutations causing large phe- 
notypic changes toward the new 
optimum will be favored, at least 
initially (78). For example, recently 
domesticated races have probably 
experienced strong selection by farm- 
ers, and many recent domestication 
traits result from mutations that cause 
large phenotypic effects, including 
pleiotropic deleterious effects. As an 
example, six different null-coding 
mutations in the myostatin gene cause 
muscle hypertrophy in different 
breeds of cattle (19). Myostatin is a 
member of the transforming growth 
factor-B superfamily of growth factors 
and acts as a negative regulator of 
muscle development. Although null 
mutations of myostatin generate cat- 
tle with more and leaner meat, these 
cattle experience difficulties in calv- 
ing and have reduced stress tolerance. 
Strong selection during domestication 
can obviously overcome the negative 
pleiotropic effects of null myostatin 
mutations. 

Population history. The past and 
current sizes of a population also in- 
fluence genetic evolution. Small pop- 
ulation size increases the effects of 
random sampling of alleles, so-called 
genetic drift. In small populations, 
genetic drift will allow deleterious 
alleles to occasionally increase in fre- 
quency. For example, a small inbred 
population of Bedouins in Israel has 
evolved a high frequency of a reces- 
sive allele that causes deafness (20). 
With stronger genetic drift in small 
populations, natural selection will fail 
to promote the spread of adaptive mu- 
tations of small effect. Instead, in 
comparison with large populations, 
adaptive mutations of relatively large 
effect will tend to evolve by natural 
selection in small populations. 

Small populations also have an- 
other critical effect on evolution: They 
limit the total number of new mu- 
tations introduced into the population 
each generation. Thus, small popula- 
tions may end up selecting far-from- 
ideal mutations (those with pleiotropic 
consequences and epistatic effects) 
simply because potentially superior 
mutations occur at a lower rate. 

The abundance of null Frigida mu- 
tations in populations of A. thaliana 
highlights the importance of popu- 
lation history in genetic evolution. 
Null Frigida mutations have the 
negative consequence of reducing 
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Fig. 2. Bristle patterns on the dorsal thorax of Drosophila species have evolved within species and between 
species because of different kinds of mutations. (A) A mutation generating a null allele of the poils au dos gene 
within a population of D. melanogaster increases the number of large bristles on the dorsal thorax (white 
triangles indicate normal bristles and green triangles indicate extra bristles) (23). In contrast, the increased 
number of bristles in D. quadrilineata results at least in part from changes in the cis-regulatory region of the scute 
gene (12). The extra bristles caused by the poils au dos mutation are not as precisely positioned as the extra bristles 
caused by the scute mutation (indicated by purple triangles). (B) The two evolving genes, poils au dos and scute, 
occupy different locations in the genetic network that generates the pattern of bristles. The scute gene is an input- 
output gene, whereas the poils au dos gene is a developmental patterning gene. The null mutation in poils au dos 
increases sensory organ numbers not only in the thorax but also in the wing. 
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plasticity for flowering time. These mutations 
also have pleiotropic effects [they reduce fruit 
production (2/)] and display epistasis with 
respect to other genes that control flowering 
time (22). These observations suggest that null 
Frigida mutations are not ideal alleles for con- 
trolling flowering time. In fact, null Frigida mu- 
tations must only rarely, if ever, be involved in 
phenotypic divergence between species because 
homologs of the Frigida gene exist in diverse 
plant species. But natural selection has over- 
come the deleterious effects of null Frigida mu- 
tations to promote the spread of these mutations 
in small populations. A. thaliana has migrated 
from Scandinavia around the world 

in the footsteps of agriculture. Sub- 

populations have adapted to local 

conditions, including the relatively 

warm and short winters of more 


temperate regions. A. thaliana plants 1.00 
are self-fertile, so even a single plant o 
can give rise to a new population. 6 
: gj (0.75 
These small subpopulations pro- s 
vide fewer opportunities for bene- € 
ficial mutations of specific effect Ke) 0.50 
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: 3 = 
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of null Frigida mutations probably 
reflects the fact that these mutations 
occur at a higher rate than muta- 
tions without associated deleterious 
consequences. 


Fig. 3. Different kinds of mutations occur with different frequency during 
short-term and long-term evolution. Among all mutations causing mor- 
phological variation identified to date, the proportion of cis-regulatory 
mutations (black bars) is higher for long-term evolution than for short-term 
evolution. For all mutations that have been reported to cause phenotypic 
variation in either morphology or physiology, the proportion caused by null 
coding mutations (red bars) is higher for short-term evolution than for long- 
term evolution. The numbers above the bars refer to the total number of 
examples in each category. The number of cases of morphological evolution 
(black bars) is a subset of the number of cases of phenotypic evolution (red 
bars). Data are from (1). 


The Genetic Basis of Short-Term 
and Long-Term Evolution 


The Frigida example is not unique. 
In many plants and animals, evolu- 
tion over long periods (variation 
between species) appears to differ 
in several ways from evolution 
over shorter periods (variation be- 
tween domesticated races and be- 
tween individuals within a species) 
(/). Here are three general ways in 
which long-term and short-term genetic evo- 
lution differ. 

First, epistasis is commonly found for the 
mutations that contribute to phenotypic vari- 
ation within species, whereas it is rarely ob- 
served for the mutations that cause differences 
between species. Within D. melanogaster, var- 
iation in bristle number is caused by multiple 
loci of relatively small effect, and these loci 
have epistatic effects of the same order of 
magnitude as the additive effects (23, 24). In 
contrast, morphological differences between 
Drosophila species result from multiple loci 
of intermediate-to-small effect that only rarely 
show epistasis (25, 26). Studies of body size 
variation in chickens show a similar pattern, 


with alleles segregating within species show- 
ing more epistasis than alleles differentiating 
species (27, 28). 

Second, null mutations, which arise frequent- 
ly and often cause pleiotropic and epistatic 
effects, seem to contribute more to phenotypic 
variation within species than to phenotypic dif- 
ferences between species. About 55% of the 
99 mutations known to cause domestication 
traits are null-coding mutations, whereas only 
7% of the 75 mutations known to cause inter- 
specific differences are null-coding mutations 
(Fig. 3). For example, although domesticated 
cattle stocks have evolved multiple null muta- 
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tions of the myostatin gene, all mammal species 
investigated so far possess a functional myostatin 
gene. 

Third, the frequency of cis-regulatory mu- 
tations causing morphological variation differs 
between taxonomic levels. Morphological changes 
may occur either through coding changes or 
through cis-regulatory changes (Fig. 2). Be- 
cause mutations in cis-regulatory regions often 
have fewer pleiotropic effects than mutations in 
coding regions, morphological changes are 
expected to involve mainly cis-regulatory muta- 
tions (1, 2, 29). Within species, most mutations 
that cause morphological variation have been 
found in protein-coding regions (Fig. 3). In 
contrast, between species most mutations that 
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cause morphological differences have been found 
in cis-regulatory regions. Presumably, many of 
the coding mutations found within species fail to 
spread through populations, perhaps because of 
pleiotropic deleterious effects. 

These striking and unexpected differences 
between short-term and long-term genetic evolu- 
tion have emerged only recently with the accu- 
mulation of a sufficient number of case studies. 
These patterns are consistent with theoretical ex- 
pectations of how the five parameters discussed 
earlier (pleiotropy, epistasis, plasticity, strength 
of selection, and population structure) should in- 
fluence genetic evolution. Evolution over long 
periods, reflected in the differences 
between species, should result from 
mutations relatively devoid of plei- 
otropic and epistatic effects. In con- 
trast, evolution over shorter periods, 
reflected in the differences between 
domesticated races and in the var- 
iation segregating within species, 
may often result from mutations that 
disrupt plasticity or that have pleio- 
tropic and epistatic effects. In sum- 
mary, differences between species 
are caused by a biased subset of the 
mutations that have appeared within 
22 natural populations (/). 


Conclusions 


Although mutations are thought to 
occur randomly in the genome, the 
distribution of mutations that cause 
biological diversity appears to be 
highly nonrandom. Gene function, 
gene structure, and the roles of 
genes and gene products in genetic 
networks all influence whether par- 
ticular mutations will contribute to 
phenotypic evolution. Thus, for some 
phenotypic changes, evolutionari- 
ly relevant mutations are expected 
to accumulate in a few hot-spot 
genes and even in particular regions 
within single genes. In addition, 
population biology and ecology in- 
fluence the spectrum of evolutionarily relevant 
mutations. Over short periods, adaptive muta- 
tions with deleterious pleiotropic effects may 
be selected because mutations without delete- 
rious effects have not yet appeared. In contrast, 
over long periods adaptive mutations without 
pleiotropic deleterious effects have more oppor- 
tunity to arise and be selected. 

The genetic basis of phenotypic evolution 
thus appears to be somewhat predictable. These 
emerging patterns in the distribution of mutations 
causing phenotypic diversity derive, however, from 
a limited set of data culled from the published 
literature. It is possible that these patterns re- 
flect biases in the way scientists have searched 
for evolutionarily relevant mutations (/). For 
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example, many researchers focus on candidate 
genes, which precludes the discovery of pre- 
viously unknown genes. In the future, we expect 
that widespread adoption of unbiased experi- 
mental approaches, for example genetic map- 
ping, will provide data for robust tests of the 
predictability of genetic evolution. Genetic map- 
ping can be performed within species and, in 
rare cases, between closely related species. Al- 
ternatively, gene-by-gene replacement of all genes 
from one species into a second species, although 
experimentally tedious, may allow unbiased sur- 
veys for species that cannot be crossed. This 
approach would allow comparisons of distantly 
related taxa and provide a direct test of whether 
distantly related taxa have accumulated differ- 
ent kinds of evolutionarily relevant mutations 
than have closely related species. 

More precise quantitative predictions about 
the mutations responsible for phenotypic evolu- 
tion will probably result from further synthesis 
of molecular biology and population genetics. 
New theoretical models will encompass multi- 
ple population-genetic parameters within a ge- 
nomic and developmental framework. These 
models may provide insight into how the dis- 
tribution of spontaneously arising mutations is 
translated into the distribution of mutations seg- 


regating within populations and how these two 
distributions impact short-term and long-term 
evolution. 

Finally, the fact that long-term genetic evo- 
lution may represent a biased subset of mutations 
has applied consequences, from the development 
of more efficient computer algorithms that utilize 
evolutionary search strategies to the improve- 
ment of agricultural crops and animals. Domes- 
tication often selects for mutations that have 
pleiotropic deleterious effects. Long-term evolu- 
tion, in contrast, selects for mutations with specific 
phenotypic effects, and this class of mutations 
might be exploited to engineer domesticated 
races that possess desirable characteristics with- 
out associated unfavorable properties. 
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The Sea-Level Fingerprint of 
West Antarctic Collapse 


Jerry X. Mitrovica,’ Natalya Gomez,’ Peter U. Clark? 


here is widespread concern that the West 

Antarctic Ice Sheet (WAIS), which is char- 

acterized by extensive marine-based sec- 
tors (/), may be prone to collapse in a warming 
world. The recent Fourth Assessment Report of 
the Intergovernmental Panel on Climate Change 
(IPCC) (2) estimated that this collapse would lead 
to a sea-level rise of ~S m. This estimate 
is derived by converting the total volume 
of all grounded portions of WAIS into 
water, filling in any topographic holes 
associated with marine-based sectors, 
and spreading the remaining water uni- 
formly (i.e., eustatically) across the oceans 
(3). Although the appropriate effective 
eustatic value (EEV) for WAIS collapse 
is uncertain (e.g., will non—marine-based 
sectors vanish?), we show that, what- 
ever the value, sea-level changes at 
some coastal sites will be significantly 
higher (or, less commonly, lower) than 
the EEV. 

The rapid melting of ice sheets and 
glaciers leads to a sea-level change that 
departs dramatically from the assump- 
tion of a uniform redistribution of melt- 
water (4). An ice sheet exerts a gravitational 
attraction on the nearby ocean and thus 
draws water toward it. If the ice sheet 
melts, this attraction will be reduced, and 
water will migrate away from the ice sheet. 
The net effect, despite the increase in the 
total volume of the oceans after a melting 
event, is that sea level will actually fall 
within ~2000 km of the collapsing ice 
sheet and progressively increase as one 
moves further from this region. Each ice 
reservoir will produce a distinct geom- 
etry, or fingerprint, of sea-level change. 

Although the physics of fingerprint- 
ing has been embraced in studies of past 
sea-level change, it has been largely ig- 
nored in discussions of future projec- 
tions. A widely neglected exception is 
an analysis by Clark and Lingle (5), who 
were concerned with sea-level changes 
after a “uniform thinning” of the WAIS. 
Their fingerprint calculation had its basis 
in a standard sea-level theory that ac- 
counts for load self-attraction (as de- 
scribed above) and the associated elastic 
deformation of the solid Earth. Their re- 
sults, reproduced in Fig. 1A, show a 
zone of sea-level fall close to the WAIS 
and a maximum rise in the North Pacific. 


Coastal sites well away from the WAIS have peak 
values ~5 to 10% higher than the EEV. 

The sea-level theory adopted by Clark and Lingle 
does not allow for shoreline migration, including the 
inundation and adjustment of regions vacated by 
grounded, marine-based ice cover, or any feedback 
onto sea level of Earth rotation changes. We show 


Fig. 1. Sea-level change in response to the collapse of the 
WAIS computed by using (A) a standard sea-level theory (5), 
which assumes a nonrotating Earth, no marine-based ice, and 
shorelines that remain fixed to the present-day geometry with 
time, as well as (B) a prediction based on a theory (6) that 
overcomes these limitations. Both predictions are normalized by 
the EEV associated with the ice collapse. In (B), the total volume 
of the WAIS is used in the calculation, whereas in (A) only an 
amount of ice with a volume that matches the EEV is removed 
(because the latter cannot take into account the inundation of 
marine-based sectors). (C) The difference between predictions 
generated by using the two sea-level theories [(B) minus (A)]. 


DyD 

_) 1 \ 

(Fig. 1B) a projection based on a sea-level theory 
(6) that overcomes these limitations. These results 
show a highly accentuated sea-level rise in the oceans 
bordering North America and in the Indian Ocean. 
Coastal sites in North America would experience a 
rise ~30% higher than the EEV. 

The difference between this and the standard 
(Clark and Lingle) calculations is shown in Fig. 1C. 
The far-field geometry of the differential sea-level 
signal, which includes a roughly uniform rise in com- 
bination with a quadrantial form, is diagnostic of the 
physical mechanisms responsible for the accentuated 
signal (/). In particular, the uniform rise is due to the 
expulsion of water from the West Antarctic as flooded, 
marine-based sectors of this region uplift (elastically) 
in response to the unloading (fig. S1B). The dominant 
quadrantial signal arises from a feedback associated 
with Earth rotation (7). In particular, the collapse of the 
WAIS leads to a displacement of the south rotation 
pole of ~100 m x EEV toward the West Antarctic; 
this shift drives a sea-level rise in North America and 
the Indian Ocean and a fall over South America and 
Asia relative to the EEV (fig. S1, A and C). 

These results reinforce serious concerns about the 
impact on some coastal communities of a future insta- 
bility in the WAIS. Consider Washington, DC, and 
the case where we adopt the conventional value of 5 m 
for the EEV. We predict a sea-level rise 1.3 m higher 
than the EEV (or 6.3 m total) at this site, an increase 
above the EEV that is three times greater than pre- 
dicted using the standard sea-level theory (Fig. 1A). 
Any robust assessment of the sea-level hazard asso- 
ciated with the loss of major ice reservoirs must, of 
course, account for other potential sources of melt- 
water, namely Greenland, the East Antarctic, and 
mountain glaciers (8). Nevertheless, future projec- 
tions should avoid simple, eustatic estimates and 
be based on a suitably complete sea-level theory. 
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The Formation of Massive Star 
Systems by Accretion 


Mark R. Krumholz,** Richard I. Klein,2? Christopher F. McKee,7"4 


Stella S. R. Offner,* Andrew J. Cunningham? 


Massive stars produce so much light that the radiation pressure they exert on the gas and 

dust around them is stronger than their gravitational attraction, a condition that has long been 
expected to prevent them from growing by accretion. We present three-dimensional radiation- 
hydrodynamic simulations of the collapse of a massive prestellar core and find that radiation 
pressure does not halt accretion. Instead, gravitational and Rayleigh-Taylor instabilities channel 
gas onto the star system through nonaxisymmetric disks and filaments that self-shield against 
radiation while allowing radiation to escape through optically thin bubbles. Gravitational 
instabilities cause the disk to fragment and form a massive companion to the primary star. 
Radiation pressure does not limit stellar masses, but the instabilities that allow accretion 


to continue lead to small multiple systems. 


120 times that of the Sun (/, 2), but the 

mechanism by which the most massive 
stars form is a long-standing mystery. Stars greater 
than ~20 solar masses (Mj) have Kelvin times 
(the time required for a star to radiate away its 
gravitational binding energy) that are shorter than 
their formation times, and as a result they attain 
their full luminosities while still accreting from 
their natal clouds. As the radiation from such an 
embedded, massive star diffuses outward through 
the dusty gas in the protostellar envelope, it exerts 
a force that opposes gravity. Spherically averaged, 
the ratio of the radiative and gravitational forces is 
7.7 x 10° Ko (L/M)o, where Ko is the specific 
opacity of the gas in cm’/g and (L/M)) is the stel- 
lar light-to-mass ratio in units of Lo/M. (where 
Lo 1s the luminosity of the Sun). Because the 
dusty envelopes of massive protostars have Ky ~ 
5, the ratio of radiative to gravitational force 
exceeds unity for all stars with (L/M)p = 2500. 
Main-sequence stars reach this value of (L/M)o at 
masses of ~20 Mo. Therefore, radiation is ex- 
pected to halt spherically symmetric infall (3, 4) 
near this mass. Two-dimensional simulations of 
massive star formation, which include rotation 
and thus an accretion disk that partially shields the 
gas from radiation (5, 6), find that radiation 
completely halts accretion once stars reach ~40 
Mo (7); this is inconsistent with the highest 
known stellar masses. 

Here, we report a three-dimensional simula- 
tion of the formation of stars with masses greater 
than 20 Mg that includes the effects of radiation 
pressure. Three-dimensionality is important be- 


S tars can form with masses up to at least 
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cause instabilities that determine the interaction 
of gas and radiation, such as Rayleigh-Taylor in- 
stabilities, show faster growth rates and higher 
saturation amplitudes in three dimensions (8). 
Additionally, instabilities in accretion disks (9, /0) 
and the resulting formation of companion stars (//) 
can only be simulated when the disk is represented 
nonaxisymmetrically. It is important to consider 
these effects because most massive stars are mem- 
bers of multiple systems (/2, /3). 

Our initial conditions consisted of a gas cloud 
with mass = 100 Mo, radius = 0.1 pc, and density 
profile p < r |, consistent with models (14, 15) 
and observations (/6) of the initial states of mas- 
sive prestellar cores. Its initial temperature was 
20 K and it was in slow, solid-body rotation at a 
rate such that the ratio of rotational kinetic energy 
to gravitational binding energy was 0.02, which 
is consistent with the rotation rates seen in lower- 
mass cores (/7). Previous two-dimensional simu- 
lations suggest that varying these parameters within 
the observed range of massive core properties 
would not alter the qualitative behavior (7). Even 
though observed massive cores have large turbu- 
lent velocities (16), we did not include turbulence 
in our initial conditions so as to focus on the ef- 
fects of radiation pressure. Simulations that include 
the effects of turbulence (/0) do not appear to 
produce qualitatively different results. 

We evolved this initial state by means of our 
adaptive mesh refinement code ORION (/8—22), 
which solves the equations of gravito-radiation- 
hydrodynamics in the gray, flux-limited diffusion 
approximation. The code dynamically increases 
resolution as needed down to a minimum cell 
size of 10 AU; regions that collapse to densities 
above the Jeans density at the maximum resolu- 
tion become star particles, each of which produces 
a luminosity determined by a protostellar evolu- 
tion model (23). 

The simulation passed through several distinct 
phases (Fig. 1, figs. S1 to S3, and movie S1), 
forming multiple stars (Fig. 2). The cloud began 


to collapse immediately and a central protostar 
formed 3600 years afterward. For the next 17,000 
years, the protostar accreted smoothly via an axi- 
symmetric disk (Fig. 1A). During this phase, the 
mass of the star grew to 11 Mo and its luminosity 
remained below ~10* Lg. Because (L/M)y < 1000, 
radiation pressure produced no noticeable effects. 
After ~20,000 years, the disk became gravitation- 
ally unstable and developed a pronounced two- 
armed spiral that transported angular momentum 
efficiently (Fig. 1B) (24). Accretion onto the pro- 
tostar continued smoothly. 

Accretion, unimpeded by radiation pressure, 
continued until 25,000 to 26,000 years, when the 
star reached a mass of roughly 17 Mo and achieved 
a sustained (L/M)o value of ~2500, driven by Kelvin- 
Helmholtz contraction. This was luminous enough 
for its radiation pressure force to exceed the grav- 
itational force, and the star began to drive gas 
outward around the polar axis, inflating radiation- 
filled bubbles both above and below the accretion 
disk (Fig. 1C). The density inside the bubbles was 
very low, and within them the radiation pressure 
exceeded the gas pressure by orders of magnitude. 
Almost all gas falling onto the protostar struck the 
walls of the bubbles, where it was shocked and 
swept up into the bubble walls. However, this did 
not slow accretion in our simulation, because the 
gas that struck the bubble walls eventually trav- 
eled along the margin until it reached the disk, at 
which point it continued to accrete onto the star. 
During this phase, radiation forces acting on ma- 
terial accreting onto the disk and gravitational 
forces acting on material in the disk caused it to 
become increasingly nonaxisymmetric. A series 
of small secondary stars formed in the disk, most 
of which advected inward because of dynamical 
friction with the gas and collided with the central 
protostar. As a result, the accretion rate onto the 
central star became variable, but its mean value 
remained roughly unchanged. 

Around 35,000 years, a series of disk-borne 
stars collided and became massive enough to re- 
sist being dragged inward (Fig. 1D). The second- 
ary star was initially smaller than the primary and 
orbited it, intercepting and accreting much of the 
inflowing gas (25). As a result, the secondary star 
acquired its own disk and grew faster at first than 
the primary, and the ratio of the mass of the 
secondary to that of the primary reached a value 
of >0.5. Thereafter, the disk continued to frag- 
ment but at a much-diminished rate, and accre- 
tion became almost evenly divided between the 
two massive stars. A third small disk-borne star 
was ejected into a wide orbit in our simulation, 
but eventually fell back and was captured and 
accreted. The total accretion rate onto the binary 
system varied periodically as the stars orbited one 
another, but its time-averaged value remained about 
the same as before binary formation. From this 
point onward, the bubbles showed instability and 
constantly changed shape while undergoing slow 
overall expansion. Accretion onto the system con- 
tinued uninterrupted. 
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We halted the simulation at 57,000 years, after 
a ~20,000-year period when there was no further 
qualitative change in the evolution (Fig. 1E). At 
this point the system was a binary with a total 
mass of 70.7 Mo and a time-averaged total lumi- 
nosity of ~5 x 10° Lo. The two stars had masses 
of 41.5 Mo and 29.2 Mo and were 1590 AU 
apart. Neglecting the effects of the gas, the 
semimajor axis of the orbit was 1280 AU 
(eccentricity 0.25), but because this neglects the 
gas, it may be an overestimate. Orbits like this are 
typical of young O stars, at least 40% of which 
are visual binaries with separations of ~1000 AU 
(12). These are not the final system parameters, 
because the envelope and the disk still contained 
28.3 Mo of gas and the accretion rate had not 
diminished. However, the qualitative nature of 
the final system was well established. 

We compared our result to two-dimensional 
simulations. The largest star that formed in any 
two-dimensional simulation with gray radiative 
transfer had a mass of 22.9 Mo. If the simulation 
included a multifrequency treatment of the radia- 
tion, which we omitted because of its computa- 
tional cost (23), the maximum mass of the star that 
formed was 42.9 M, (7). In these two-dimensional 
simulations, the initial phases of collapse, disk 
formation, and growth of a polar bubble were 
quite similar to ours, although the disk lacked 
nonaxisymmetric structure. In both cases there 
was a “flashlight effect” (7, 26) in which the disk 
beamed radiation preferentially in the polar 
direction. In two dimensions, however, as the 
star’s mass grew, radiation halted accretion over 
an ever larger fraction of the solid angle around 
the star. This eventually stopped infall onto the 
disk. Some of the gas remaining in the disk 
continued to accrete onto the star, but at a 
diminishing rate, and eventually the disk density 
became low enough for stellar radiation to blow 
it away. 

This never happened in our simulation. Instead, 
when the luminosity became large enough that our 
bubbles no longer delivered mass to the disk ef- 
ficiently, they became asymmetric and clumpy. In 
some places radiation blew out sections of the 
bubble wall, whereas in others dense filaments of 
gas fell toward the stars (Fig. 3). The structure of 
dense fingers of heavy, downward-moving fluid 
alternating with chimneys of outgoing radiation 
is analogous to that of a classical Rayleigh-Taylor 
instability, with radiation taking the place of the 
light fluid. Radiation forces away from the star 
are stronger than gravity when averaged over 47 
st, producing velocities and net forces that have 
an outward direction over most of the solid angle. 
Much of the mass is concentrated into the dense 
fingers, and because radiation flows around rather 
than through these structures, within them the 
velocity and the net force have an inward direc- 
tion. However, this did not remove the angular 
momentum of the gas, so it continued to fall onto 
the disk rather than directly onto the stars. The 
growth of clumps in the disk that form secondary 
stars is a natural side effect of this process, but 


radiation may be just accelerating a process that 
is caused by gravity. At least 40% of the accreting 
gas reached the disk through this Rayleigh-Taylor 
mechanism; gas falling onto the outer disk di- 
rectly accounted for ~25% of the accretion, and 
gas reaching the disk by traveling along the 


Fig. 1. Snapshots of the sim- 
ulation at (A) 17,500 years, 
(B) 25,000 years, (C) 34,000 
years, (D) 41,700 years, and 
(E) 55,900 years. In each 
panel, the left image shows 
column density perpendic- 
ular to the rotation axis in a 
(3000 AU) region; the right 
image shows volume density 
in a (3000 AU)? slice along 
the rotation axis. The color 
scales are logarithmic (black 
at the minimum, red at the 
maximum), from 10° to 
107° g cm on the left and 
10-* to 10° g cm? on the 
right. Plus signs indicate the 
projected positions of stars. 
See figs. S1 to S3 and movie 
S1 for additional images. 


REPORTS 


bubbles’ outer walls contributed the remaining 
~35% (23). 

Continued disk feeding is what made the 
three-dimensional results different from earlier 
two-dimensional ones. At 34,000 and 41,700 
years (Fig. 1, C and D), bracketing the onset of 
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instability, the total stellar mass was 32.4 Mj and 
46.9 Mo, respectively, and the disk mass was 4.0 
to 5.7 Mo and 4.5 to 9.1 Mo, respectively. (The 


Fig. 2. (A) Stellar mass, (B) 
stellar luminosity, and (C) ac- 
cretion rate as a function of 
time. Black lines show values 
summed over all stars, blue 
lines show values for the most 
massive star, and red lines 
show values for the second 
most massive star. In (A), 
asterisks mark the onset of 
deuterium burning and the 
diamond marks hydrogen 
burning. In (B), solid lines 
show luminosities from all 
sources (accretion, Kelvin- 
Helmholtz contraction, and 
nuclear burning); dashed lines 
show accretion luminosity 
only. Luminosities and ac- 
cretion rates are 100-year 
running averages. 
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halted, then the evolution would likely have been 
the same as in the two-dimensional simulations. 
The remnant disk would still have accreted, but 
its low mass means that the stars would have 
gained less than 10 Mo from it. In our three- 
dimensional simulation they instead accreted 25 
to 40 Mo, and did so at a constant rather than a 
declining accretion rate. Accretion of stars rather 
than gas did not contribute appreciably to this. The 
star with a final mass of ~40 Mo gained only 1.8 Mo 
via collisions, whereas the ~30 Mo star gained 
1.2 Mo, excluding the initial collision that created it. 

The final star formation efficiency for our 
simulated core was at least 70%, and a majority 
of the mass accreted within one mean-density free- 
fall time of the initial core (52,500 years). Because 
infall was continuing at a roughly constant rate 
and no further qualitative changes were occurring 
at the time we halted the simulation, it is likely 
that much of the remaining mass would accrete 
onto the star system. In reality, protostellar out- 
flows, which we have not included in our simu- 
lation, would limit the star formation efficiency to 
~50% (27, 28). Our result indicates that, relative 
to outflows, radiation pressure does not affect star 
formation efficiency or time scale. The cavities 
generated by outflows would reduce the effects 
of radiation pressure even further (29) and would 
modify the geometry of the radiation pressure 
bubbles or would prevent their formation alto- 
gether. Photon bubble instabilities, which can oc- 
cur if the gas is sufficiently magnetized (30), might 
also reduce the effects of radiation pressure and 
modify the bubble geometry. Our simulation shows 
that even if these effects are omitted, radiation 
pressure does not present a barrier to massive star 
formation. 


4 
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Fig. 3. Snapshot of a (6000 AU)’ stice along the rotation axis at 51,100 years. Color 
indicates density from 10-2° to 10“ g cm? on a logarithmic scale as in Fig. 1. Plus 
signs show projected stellar positions. (A) Arrows show gas velocity. (B) Arrow directions 


indicate the direction of the net (radiation plus gravitational) force; lengths are pro- 


portional to the magnitude of the net force divided by the magnitude of the gravi- 


tational force. Thus, an inward arrow of length 1 represents negligible radiation force. 
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Confined Crystallization of Polyethylene 
Oxide in Nanolayer Assemblies 


Haopeng Wang, Jong K. Keum,? Anne Hiltner,?* Eric Baer,? Benny Freeman, 


Artur Rozanski,? Andrzej Galeski? 


The design and fabrication of ultrathin polymer layers are of increasing importance because of 
the rapid development of nanoscience and nanotechnology. Confined, two-dimensional 
crystallization of polymers presents challenges and opportunities due to the long-chain, covalently 
bonded nature of the macromolecule. Using an innovative layer-multiplying coextrusion process to 
obtain assemblies with thousands of polymer nanolayers, we discovered a morphology that 
emerges as confined polyethylene oxide (PEO) layers are made progressively thinner. When the 
thickness is confined to 20 nanometers, the PEO crystallizes as single, high-aspect-ratio lamellae 
that resemble single crystals. Unexpectedly, the crystallization habit imparts two orders of 


magnitude reduction in the gas permeability. 


rystalline polymers, such as polyethylene, 

polypropylene, poly(ethylene terephthal- 

ate), and nylon, have been broadly used 
as gas-barrier films in food, medicine, and elec- 
tronics packaging, benefiting thereby from their 
low cost, easy processing, and mechanical tough- 
ness. Good barrier properties are imparted by the 
ability of polymer chains to crystallize into semi- 
crystalline materials with both crystalline and 
amorphous phases (/). The efficiency of chain 
packing is such that the crystalline phase is 
generally regarded as impermeable to even small 
gas molecules, and gas transport is seen as oc- 
curring through the amorphous regions (2). The 
processing conditions can be readily varied to 
control the amount of crystallinity and chain orien- 
tation and to tune the barrier properties of the final 
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product (3). With the growing use of polymers as 
thin and ultrathin films (4, 5), morphologies have 
been found resulting from constrained two- 
dimensional (2D) polymer crystallization (4, 6). 
These crystalline morphologies could possess gas 
permeability characteristics that are not expected 
from the bulk polymers. 

Despite confinement, crystallization of polymer 
chains follows the conventional habit whereby 
polymer chains fold back and forth into stems to 
form crystalline lamellae with a thickness of ~10 to 
20 nm. It is typical of crystallization from the 
isotropic melt that the lamellae are organized in a 
spherulitic morphology (7). However, the pro- 
cesses of nucleation and growth that control the 
crystallization kinetics can be profoundly affected 
by nanoscale confinement. The thickness of ultra- 
thin polymer layers, usually a few tens of nano- 
meters, is comparable to or a small multiple of the 
lamellar crystal thickness. Hence, the isotropic 
growth of lamellar crystals is greatly hampered, 
and crystallization under confinement can pro- 
duce a specific lamellar crystal orientation. Often, 
the preferred lamellar crystal orientation is vertical 
to the layer (edge-on) (8, 9, 1/0). However, at the 


other extreme, lamellar crystal orientation parallel 
to the layer (flat-on) is observed (6, //—/4). Al- 
though the mechanisms for the specific lamellar 
orientation during confined crystallization are still 
under investigation (/5, /6), it is believed that the 
confined crystals will show anisotropic properties. 

The 2D crystallization of polymers is con- 
ventionally studied with thin films or block co- 
polymers that contain a crystallizable block. In 
the former, crystallizable layers with nanometer 
to submicron thicknesses are prepared by a so- 
lution process such as spin-coating (4, 17) or 
Langmuir-Blodgett (8) techniques. This ap- 
proach is limited by the solvent requirement and 
by the small amount of material that can be 
fabricated. In the latter, a layered morphology on 
the nanometer scale is achieved as a consequence 
of microphase separation of the dissimilar blocks. 
If the crystallization temperature of the crystal- 
lizable block is below the order-disorder transi- 
tion temperature (/9), crystallization occurs with 
confinement in the layer-normal direction. A wide 
range of crystallizable blocks have been studied 
(8, 11-14, 16). However, elucidation of the 
structure-property relationships has been hindered 
by the need to synthesize the block copolymers and 
by the shear alignment that is required to construct 
the uniformly oriented phase structure (20). 

In contrast to the self-assembled confinement 
created with microphase-separated block copoly- 
mers (2/), layer-multiplying coextrusion uses 
forced assembly to create films with hundreds or 
thousands of alternating layers of two polymers 
(22, 23). Almost any melt-processable polymer 
can be fabricated into kilometers of nanolayered 
films, and layers less than 10 nm in thickness 
have been made. Although the amount of ma- 
terial in a single layer is very small, the properties 
of the confined layer are multiplied many-fold by 
the number of identical layers in the assembly. 
This enables us to use conventional methods to 
probe size-scale-dependent properties. Poly- 
ethylene oxide (PEO) was coextruded with poly 
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(ethylene-co-acrylic acid) (EAA), a copolymer 
with much lower crystallinity than PEO (fig. S1). 
Films with 33, 257, and 1025 alternating EAA 
and PEO layers were coextruded with various 
thicknesses and various composition ratios, in- 
cluding (EAA/PEO vol/vol) 50/50, 70/30, 80/20 
and 90/10 (table S1). The nominal PEO layer 
thickness, which was calculated from the number 
of layers, the composition ratio, and the film 
thickness, varied from 3.6 um to 8 nm. Non- 
layered control films were also extruded. 
Permeability to small molecules is an impor- 
tant performance property of polymer films. Gas 
transport also provides a probe into the solid-state 
structure (3). The oxygen permeability (P) was 
first measured on films where the PEO and EAA 
layers had the same thickness. The layer thick- 
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ness was varied by changing the number of co- 
extruded layers and the overall film thickness 
while maintaining the composition at 50/50. The 
results, plotted in Fig. 1A as a function of layer 
thickness, unexpectedly showed a steady decrease 
in the oxygen permeability as the EAA and PEO 
layers became thinner. A comparable reduction 
was found for carbon dioxide permeability. 

The series model for layered assemblies gives 


the gas permeability as 
-1 
o 1-9 
P| = ( ‘PEO ‘PEO (1) 
Pro Prsa 


where opgo is the volume fraction of PEO, and 
Ppgo and Pgaa are the permeabilities of PEO 
and EAA control films, respectively. Using the 
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Fig. 1. The effect of layer thickness on oxygen permeability. (A) Oxygen permeability of films with equal 
volume fractions of EAA and PEO. The dashed line indicates P, calculated from Eq. 1. (B) Oxygen permeability 
of the PEO layers from films of varying composition calculated from Eq. 2. The dashed line indicates Ppeo. The 
open symbol is for a film with PEO layer breakup. The solid lines are drawn to guide the eyes. 
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Fig. 2. AFM phase images of partial cross sections of the layered EAA/PEO 
films. The PEO layer has substantially higher crystallinity than the EAA layers 
and hence appears bright in the AFM images. (A) A low-resolution image of 
an EEA/PEO film with 50/50 composition, 33 alternating layers, and nominal 
PEO layer thickness of 3.6 um. (B) A higher-resolution image showing the 
spherulitic morphology of the 3.6-um-thick PEO layer. (C) A low-resolution 
image of an EAA/PEO film with 70/30 composition, 1025 alternating layers 
and nominal PEO layer thickness of 110 nm. (D) A higher-resolution image of 
the 110-nm-thick PEO layers showing the oriented stacks of PEO lamellae. (E) 
A high-resolution image of an EAA/PEO film with 90/10 composition, 1025 
alternating layers, and nominal PEO layer thickness of 20 nm showing that 
the PEO layers crystallized as single, extremely large lamellae. (F) A schematic 
showing the gas diffusion pathway through the layered assembly with 20-nm- 
thick PEO layers. The arrows identify the EAA layers and PEO layers. 
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determined values of 0.38 barrer (24) and 2.30 
barrer for Ppgo and Pega, Eq. 1 gave the oxygen 
permeability of an EAA/PEO 50/50 layered 
assembly as 0.65 barrer. Only the film with the 
thickest layers conformed to this prediction. 

Noting that PEO is substantially less perme- 
able to oxygen than EAA, Eq. 1 predicts that P, 
will be quite sensitive to Ppgo even if pgo is 
relatively small. To ascertain whether a dramatic 
change in the PEO permeability was responsible 
for the layer thickness effect, numerous films that 
varied in both the composition ratio and the layer 
thickness were tested. Although the measured 
values of P scattered, depending on the 
composition ratio, when an effective PEO perme- 
ability was extracted, 


1 1-pz0\ 7 
Prof = PEO ( P 7) (2) 


the data collapsed to a single curve when Ppro err 
was plotted as a function of the PEO layer thick- 
ness (Fig. 1B). Only the results for the thicker PEO 
microlayers conformed to Eq. 2, with Ppro err ~ 
Ppgo, as indicated by the dashed line. Deviation 
below the line was seen with 1-um-thick PEO 
layers. The lowest P prop Was achieved with 
20-nm layers, and the value of 0.0052 barrer was 
almost 2 orders of magnitude less than Ppgo. 

It seemed likely that crystallization in a con- 
fined space resulted in an unusual crystalline mor- 
phology that endowed the PEO nanolayers with 
exquisitely low permeability. However, differen- 
tial scanning calorimetry revealed that even in the 
thinnest layers, both PEO and EAA possessed 
the same melting enthalpy and the same melting 
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temperature as the control films, which were 153 
J/g and 66°C for PEO, and 98 J/g and 98°C for 
EAA, respectively (fig. S2 and table $2). Thus, 
any unusual crystalline morphology that pro- 
vided the very low permeability of PEO nano- 
layers was not accompanied by changes in the 
crystallinity or in the lamellar thickness. 

The layers were viewed by microtoming the film 
through the thickness and examining the exposed 
surface in atomic force microscope (AFM). A region 
from the cross section of a film with 3.6-1m-thick 
PEO layers confirmed the layer continuity (Fig. 
2A). Although there was some nonuniformity, the 
average layer thickness was close to the calculated 
nominal layer thickness. A higher magnification 
image showed the sharp boundaries between EAA 
and PEO layers (Fig. 2B). The spherulitic mor- 
phology of the bright PEO layer closely resembled 
that of PEO crystallized from the unconfined melt. 
It was expected that the properties of the PEO lay- 
ers would also be the same and, indeed, the oxygen 
permeability of films with thick PEO layers closely 
conformed to Eq. 2 with P, PEO,eff ~ Prgo. 

Another pair of images in Fig. 2, C and D, 
compares a film with 110-nm-thick PEO layers. 
The images confirmed the continuity of the thin 
PEO layers and the close correspondence be- 
tween the average layer thickness and the nom- 
inal thickness. At higher magnification, the effect 
of confinement on crystallization of the PEO 
layer shows that PEO crystallized as stacks of 
three to five long, thin lamellae oriented in the 
plane of the layer. When the PEO layer thickness 
was reduced to 20 nm, most of the PEO layers 
crystallized as single, extremely large lamellae 
whose lateral dimensions frequently exceeded 


Fig. 3. Pole figures of normals to the (120) and (032) planes of the PEO 
monoclinic crystals (27). The extrusion direction is vertical and the transverse 
direction is horizontal. The normal direction is in the center of the pole figure. 
(A) The PEO control film. (B) An EAA/PEO film with 50/50 composition, 33 
alternating layers, and nominal PEO layer thickness of 3.6 um. (C) An EAA/PEO 


the field of the AFM image (Fig. 2E). Coinci- 
dence between the layer thickness, which was 
determined by the extrusion conditions, and the 
thickness of PEO lamellae, about 20 nm, facil- 
itated crystallization of the layers as single lamel- 
lae. The single lamellae could be thought of as 
very large single crystals (25). If the layer thick- 
ness was reduced to 8 nm, the layers broke up, 
which caused the increased permeability of the 
film with 8-nm PEO layers. 

The lamellar crystalline core is considered 
impermeable, and the lamellar fold surfaces 
constitute the permeable amorphous regions. 
For a single layer of flat-on lamellae, as in the 
PEO nanolayer, the diffusion pathway depends 
on the frequency of defects such as lamellar 
edges (Fig. 2F). Structurally, the nanolayered 
assembly resembles a dispersion of impermeable 
platelets of given aspect ratio. If the platelets are 
oriented perpendicular to the flux, the permeabil- 
ity of the composite is expressed as (26) 


026? = 
a5) 


where @ is the volume fraction of impermeable 
platelets and o is the aspect ratio of the plate- 
lets defined as length divided by width. In this 
case, was taken as the volume fraction of the 
PEO crystalline phase. For the thinnest PEO 
nanolayers, the aspect ratio from Eq. 3 was as 
high as 120, which meant a lateral dimension 
of more than 2 um for lamellae 20 nm thick. 
Gradually increasing the PEO layer thickness 
relaxed the restriction on 3D growth, which 
allowed crystallization of lamellar stacks and 


P= Peal de 


thickness of 20 nm. 
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film with 70/30 composition, 1025 alternating layers, and nominal PEO layer 
thickness of 110 nm. Orientation of the (120) planes is perpendicular to the 


layer plane, and orientation of the (032) planes is at 67°. (D) An EAA/PEO film 
with 90/10 composition, 1025 alternating layers, and nominal PEO layer 
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permitted randomization of the lamellar orienta- 
tion. Also, the increased area density of nuclei in 
individual layers reduced the lateral dimension of 
lamellae. The gradual change in crystallization 
habit from single lamellae to isotropic spherulites 
paralleled a gradual increase in the permeability 
of the PEO layer. 

Confirmation of the oriented lamellar mor- 
phology and details of the global orientation were 
obtained with small-angle x-ray scattering (SAXS) 
and wide-angle x-ray scattering (WAXS). The 
SAXS examined the periodic arrangement of 
lamellar crystals within the constituent layers. By 
aligning the incident x-ray beam parallel to the 
normal direction (ND), the extrusion direction (ED), 
and the transverse direction (TD), the orientation 
of lamella was determined (fig. S3). The scatter- 
ing patterns indicated that the long spacing of the 
PEO and EAA lamellae in layered films were 
22.0 + 0.6 nm and 10.8 + 0.5 nm, respectively, 
nearly the same as in the control films. Isotropic 
scattering patterns in all three directions from 
3.6 um PEO layers indicated that the PEO layers 
were too thick for PEO lamellae to feel any 
substantial confinement effect. However, as the 
PEO layer thickness decreased to 110 nm, highly 
oriented meridional two-point scattering features 
appeared in the ED and TD patterns, which in- 
dicated that stacked PEO lamellae were oriented 
primarily parallel to the layer surface due to the 
confinement. The extremely weak first-order 
peak from 20 nm PEO layers indicated that they 
existed predominantly as single lamellae rather 
than as stacks (fig. S4). 

The orientation of the PEO chains in the 
crystal was examined with 2D WAXS and pole 
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figures. Consistent results were obtained by these 
two techniques. From the pole figures of normals 
to the (120) and (032) planes of PEO (27), it is 
seen that there is no preferred orientation of crys- 
tals in the control film except for a slight orien- 
tation due to the extrusion process (Fig. 3A). In 
Fig. 3B, the film with 3.6-um-thick PEO layers 
also showed only very weak orientation. In con- 
trast, the film with 110-nm-thick PEO layers 
showed a very strong orientation of the (120) and 
also the (032) planes (Fig. 3C). Nearly all the 
(120) planes that contained polymer chains were 
perpendicular to the film plane. This meant that 
the fold surfaces of the lamellar PEO crystals 
were in the plane of the layer. Upon decreasing 
the PEO layer thickness to 20 nm, the preferred 
orientation of PEO lamellae parallel to the layers 
seemed even stronger, as indicated by the narrower 
ring at the pole figure circumference (Fig. 3D). 
The (120) planes were distributed evenly in the 
plane of film, always being perpendicular to the 
film surface. The pole figures for (032) normal in 
Fig. 3, C and D, resembled rings exactly offset by 
67°, as predicted by the crystallographic unit cell 
for the preferred orientation of PEO lamellae 
parallel to the layer interface (/6). Again, the ring 
for the (032) normal in Fig. 3D was much nar- 
rower than in Fig. 3C. 

The crystal orientation of PEO in confined 
nanolayers essentially reproduced the crystal 
structure of PEO blocks in self-assembled PS-b- 
PEO diblock copolymers (/4, 16). Comparing 
the sharpness of the WAXS pattern, considerably 
higher orientation was achieved by physically 
confining a high-molecular-weight PEO between 


force-assembled layers than by confining a low- 
molecular-weight PEO block between self- 
assembled lamellae with covalent links. When 
the thickness confinement occurred on the size 
scale of the usual lamellar thickness, the PEO 
layers crystallized as single lamellae with ex- 
tremely large aspect ratios. It was suggested that 
the lamellae could be thought of as large, im- 
permeable single crystals. 

The coextrusion process, which operates with 
readily available polymers, now makes it possi- 
ble to fabricate nanolayered polymeric structures 
in sufficient quantities to probe the structure- 
property relationships of the morphologies result- 
ing from nanoscale confinement. For design and 
execution of packaging strategies, polymer nano- 
layers can be incorporated into conventional 
polymeric films with the right barrier properties 
for less cost, which in turn may reduce the 
environmental and energy impact. 
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Nitrogen-Doped Carbon Nanotube 
Arrays with High Electrocatalytic 
Activity for Oxygen Reduction 


Kuanping Gong,? Feng Du, Zhenhai Xia,” Michael Durstock,? Liming Dai 
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The large-scale practical application of fuel cells will be difficult to realize if the expensive 
platinum-based electrocatalysts for oxygen reduction reactions (ORRs) cannot be replaced by other 
efficient, low-cost, and stable electrodes. Here, we report that vertically aligned nitrogen-containing 
carbon nanotubes (VA-NCNTs) can act as a metal-free electrode with a much better electrocatalytic 
activity, long-term operation stability, and tolerance to crossover effect than platinum for oxygen 
reduction in alkaline fuel cells. In air-saturated 0.1 molar potassium hydroxide, we observed a 
steady-state output potential of —80 millivolts and a current density of 4.1 milliamps per square 
centimeter at —0.22 volts, compared with —85 millivolts and 1.1 milliamps per square centimeter 
at —0.20 volts for a platinum-carbon electrode. The incorporation of electron-accepting nitrogen 
atoms in the conjugated nanotube carbon plane appears to impart a relatively high positive 
charge density on adjacent carbon atoms. This effect, coupled with aligning the NCNTs, provides 

a four-electron pathway for the ORR on VA-NCNTs with a superb performance. 


cathode of fuel cells (7) plays a key role in 
controlling the performance of a fuel cell, 
and efficient ORR electrocatalysts are essential 
for practical applications of the fuel cells (2, 3). 


T= oxygen reduction reaction (ORR) at the 


The ORR can proceed either through (1) a four- 
electron process to combine oxygen with electrons 
and protons directly, when coupled with oxidation 
on the anode, to produce water as the end product, 
or (ii) a less efficient two-step, two-electron path- 


way involving the formation of hydrogen peroxide 
ions as an intermediate (2, 3). Alkaline fuel cells 
with platinum-loaded carbon as an electrocatalyst 
for the four-electron ORR were developed for the 
Apollo lunar mission in the 1960s (4), but their 
large-scale commercial application has been pre- 
cluded by the high cost of the requisite noble 
metals. Apart from its high cost, the Pt-based elec- 
trode also suffers from its susceptibility to time- 
dependent drift (5) and CO deactivation (6). 
Recent intensive research efforts in reducing 
or replacing Pt-based electrode in fuel cells have 
led to the development of new ORR electrocata- 
lysts, including Pt-based alloys (7), transition metal 
chalcogenides (8), carbon nanotube—supported 
metal particles (9-1/7), enzymatic electrocata- 
lytic systems (/2), and even conducting poly(3,4- 
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ethylenedioxythiophene) (PEDOT)-coated mem- 
branes (/3). Apart from their use as the noble 
metal—catalyst supports (9—//), aligned carbon 
nanotubes formed by high-temperature treat- 
ment of certain metal heterocyclic molecules (e.g., 
ferrocene/NH3) have been recently demonstrated 
to show some ORR electrocatalytic activities (/0). 
The observed electrocatalytic activity was attri- 
buted to the presence of FeN2-C and/or FeNy-C 
active sites in the nanotube structure (/0). How- 
ever, we found in this study that vertically aligned 


Fig. 2. (A) CVs for oxygen reduction at the 
unpurified (upper) and electrochemically 
purified (bottom) VA-NCNT/GC electrodes 
in the argon-protected (dotted curves) or 
air-saturated 0.1 M KOH (solid red curves) 
at the scan rate of 100 mVs *. Before the 
measurements, the unpurified VA-NCNT/GC 
electrode was repeatedly potentiodynamic 
swept from +0.2 V to —1.2 V in an Ar- 
protected 0.1 M KOH until a steady 
voltammogram curve was obtained. (B) 
RRDE voltammograms and the correspond- 
ing amperometric responses for oxygen re- 
duction in air-saturated 0.1 M KOH at the 
NA-CCNT/GC (curves 1 and 1’), Pt-C/GC 
(curves 2 and 2’), and NA-NCNT/GC (curves 
3 and 3’) electrodes at the scan rate of 10 
mVs~*. The electrode rotation rate was 1400 
revolutions per minute (rpm), and the Pt ring 
electrode was poised at 0.5 V. (C) RRDE 
voltammograms for oxygen reduction in air- 
saturated 0.1 M KOH at the Pt-C/GC (curve 1), 
VA-CCNT/GC (curve 2), and VA-NCNT (curve 3) 
electrodes. Because of the technical difficul- 
ties associated with the sample mounting, 
amperometic responses with the Pt ring elec- 
trode were not measured for the vertically 
aligned carbon nanotubes. (D) Calculated 
charge density distribution for the NCNTs. (E) 
Schematic representations of possible ad- 
sorption modes of an oxygen molecule at 
the CCNTs (top) and NCNTs (bottom). The C 
atoms around the pyrrolic-like nitrogen 


-1.2 


nitrogen-containing carbon nanotubes (VA-NCNTs) 
produced by pyrolysis of iron(II) phthalocyanine 
(a metal heterocyclic molecule containing nitro- 
gen) (/4), in either the presence or absence of 
additional NH; vapor (/5), could be used as 
effective ORR electrocatalysts, even after a 
complete removal of the residual Fe catalyst by 
electrochemical purification [supporting online 
material (SOM) (/6)]. These metal-free VA- 
NCNTs were shown to catalyze a four-electron 
ORR process with a much higher electrocatalytic 


Fig. 1. (A) SEM image of the as-synthesized 
VA-NCNTs on a quartz substrate. (B) TEM 
image of the electrochemically purified VA- 
NCNTs. (C) Digital photograph of the VA- 
NCNT array after having been transferred 
onto a PS-nonaligned CNT conductive nano- 
composite film (fig. Sl) (16). Scale bars, 2 um 
(A); 50 nm (B). 
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activity, lower overpotential (the difference be- 
tween thermodynamic and formal potentials), 
smaller crossover effect, and better long-term 
operation stability than that of commercially 
available or similar platinum-based electrodes 
(C2-20, 20% platinum on Vulcan XC-72R; E-TEK) 
in alkaline electrolytes (77, 18). 

A cross-sectional scanning electron micros- 
copy (SEM) view for the as-synthesized VA-NCNT 
array is shown in Fig. 1A. The observed zigzag- 
like path along the nanotube length can be at- 
tributed to the integration of nitrogen into the 
graphitic structure to alter the nanotube surface 
from a straight cylinder geometry (/0). The pres- 
ence of structural nitrogen was confirmed by x-ray 
photoelectron spectroscopic (XPS) measurements 
(fig. S2) (16), which show the incorporation of 
pyridinic-like (399 eV) and pyrrolic-like (401 eV) 
nitrogen atoms within the nanotube graphene 
sheets with a N:C atomic ratio in the range of 
~4 to 6 atomic % (table S1) (/6). The aligned 
structure remained largely unchanged after the 
electrochemical purification (/6, 19). 

The corresponding transmission electron mi- 
croscopy (TEM) image shows that, after electro- 
chemical purification, the individual NCNTs are 
free from residual iron catalyst particles (Fig. 1B) 
(16). These NCNTs are about 8 jim long and 25 nm 
in outer diameter and exhibit a bamboo-like struc- 
ture such as that seen for other CNTs (/4). Figure 
1C shows a digital photograph of the electro- 
chemically purified VA-NCNT film after having 


-0.8 -0.4 0.0 
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could possess much higher positive charges than do the C atoms around the pyridinic-like nitrogen (fig. $6) (26). 
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been transferred onto a polystyrene (PS)— 
nonaligned carbon nanotube conductive compos- 
ite film to be used to cover the active area of a 
glassy carbon electrode for ORR (fig. S1) (/6). 
The size of the free-standing VA-NCNT film 
thus prepared is limited mainly by the size of the 
furnace used for the aligned nanotube growth. 

The electrochemical purification was con- 
firmed by voltammetric responses of the VA- 
NCNT electrode before and after electrochemical 
oxidation (/6). As shown in Fig. 2A, the cyclic 
voltammogram (CV) of the glassy carbon— 
supported aligned unpurified VA-NCNT elec- 
trode (VA-NCNT/GC) (fig. S1) (76) in an aqueous 
solution of 0.1 M KOH under argon protection 
(top dot curve in Fig. 2A) exhibited two well- 
defined peaks at the potential of about —1.0 V and 
—0.65 V, arising from redox reactions associ- 
ated with the Fe catalyst residues (20). In con- 
trast, the corresponding voltammetric response 
for the same VA-NCNT/GC electrode after the 
electrochemical purification showed a featureless 
curve (bottom dot curve in Fig. 2A). This dif- 
ference strongly suggests that the iron catalyst 
residues have been largely removed by the elec- 
trochemical oxidation, as also evidenced by XPS 
(fig. S2) (16) and thermogravimetric analyses (fig. 
S3) (16). 

Also shown in Fig. 2A are CVs for oxygen 
reduction in 0.1 M KOH solution at the VA- 
NCNT/GC before (top solid red curve) and after 
(bottom solid red curve) the electrochemical pu- 
rification. Unlike conventional carbon electrodes 
(21), a cathodic process with a rather high re- 
duction potential (i.e., lower overpotential) of 
about —0.15 V was seen for the ORR at both 
the unpurified and electrochemically purified 
VA-NCNT/GC electrodes with some notable 
difference in the peak shape. To gain further 
insight on the ORR electrochemical proce- 
dures, we performed rotating ring-disk electrode 
(RRDE) voltammograms. Figure 2B shows the 
steady-state voltammograms for nitrogen-free 
nonaligned carbon nanotubes supported by a 
glassy carbon electrode (NA-CCNT/GC, curve 1), 
commercially available platinum-loaded carbon 
(Vulcan XC-72R) supported by a glassy carbon 
electrode (Pt-C/GC, curve 2), and glassy carbon— 
supported nonaligned nitrogen-containing carbon 
nanotubes (NA-NCNT/GC, curve 3) in air- 
saturated 0.1 M KOH electrolyte. The corre- 
sponding amperometric responses (curves 1’ to 
3’) for the oxidation of hydrogen peroxide ions 
(HO, ) measured with a Pt ring electrode at the 
potential of 0.50 V (6) are also included in Fig. 
2B. The NA-CCNT/GC electrode showed a two- 
step process for ORR with the onset potential of 
about —0.22 V and —0.70 V, respectively (curve 1 
in Fig. 2B), consistent with previous reports 
(8, 21). We attribute the first sharp step over 
—0.22 V to the two-electron reduction of O> to 
HO, , as evidenced by a substantial concomi- 
tant increase in the oxidation current over about 
—0.22 to —0.41V at the ring electrode (curve 1’ 
in Fig. 2B). The subsequent gradual decrease in 


the ring current seen in curve 1’ of Fig. 2B cor- 
responds to the reduced amount of HO, reached 
to the ring electrode before being oxidized into 
HO at the disk electrode under the increased 
negative potentials. 

Unlike the NA-CCNT/GC electrode, the NA- 
NCNT/GC electrode exhibited a one-step process 
for the ORR with the steady-state diffusion cur- 
rent that was almost twice that obtained at the 
NA-CCNT/GC electrode (curves | and 3 in Fig. 
2B). Like the Pt-C/GC electrode, the observed one- 
step process suggests a four-electron pathway 
for the ORR at the NA-NCNT/GC electrode, as 
also supported by the corresponding negligible 
current for HO, oxidation recorded at the Pt 
ring electrode (curve 3’ in Fig. 2B). The transferred 
electron number (7) per oxygen molecule involved 
in the ORR was calculated from Eq. 1 (22) to be 
1.8 and 3.9 for the NA-CCNT/GC electrode (at 
the potential of —0.40 V) and the NA-NCNT/GC 
electrode (at the potential of -0.30 V), respectively. 


n= 4Ip/(Up + Ir/N) (1) 


where N = 0.3 is the collection efficiency (/6), 
Ip is the faradic disk current, and Jp is the 
faradic ring current. 

Figure 2C shows the steady-state voltam- 
mograms for Pt-C/GC (curve 1), VA-CCNT/GC 
(curve 2), and VA-NCNT/GC (curve 3) elec- 
trodes in the air-saturated 0.1 M KOH electrolyte. 

The half-wave potentials for ORR at the NA- 
NCNT/GC (curve 3, Fig. 2B) and VA-NCNT/GC 
electrodes (curve 3 in Fig. 2C) are comparable 
to that at the Pt-C/GC electrode (-0.1 V), but a 
substantially enhanced steady-state diffusion cur- 
rent (~0.8 mA) was observed over a large poten- 
tial range for the VA-NCNT/GC electrodes with 
respect to the Pt-C/GC electrode (Fig. 2C). Thus, 
the VA-NCNT/GC electrode is much better than 
the Pt-C/GC electrode for the ORR in an alka- 
line solution. Compared with the NA-NCNT/GC 
electrode, the better electrocatalytic performance 
of the VA-NCNT/GC electrode can be attributed 
to its well-defined large surface area with all of 
the nanotube top-ends falling on one plane at the 
interface between the aligned nanotube electrode 
and electrolyte solution to further facilitate the 
electrolyte/reactant diffusion (23, 24). Similar cur- 
rent enhancement by the alignment structure was 
also observed for the VA-CCNT/GC electrode 
(curve 2 in Fig. 2C) with respect to its nonaligned 
counterpart (curve | in Fig. 2B), albeit with a 
relatively small effect and at a very high over- 
potential. The similar shape of the steady-state 
voltammograms for the aligned nanotube elec- 
trodes and their respective nonaligned counterparts 
indicates that the electrochemical mechanism for 
the ORR is insensitive to the alignment structure, 
although alignment can improve the electrokinet- 
ics. However, the much stronger diffusion-limited 
currents and lower overpotentials observed for the 
NCNT electrodes than their nitrogen-free counter- 
parts indicate the importance of the nitrogen het- 


“1.2 
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Fig. 3. CVs for the ORR at the Pt-C/GC (top) and 
VA-NCNT/GC (bottom) electrodes before (solid black 
curves) and after (dotted curves) a continuous po- 
tentiodynamic swept for ~100,000 cycles in an air- 
saturated 0.1 M KOH at room temperature (25 + 
1°C). Scan rate, 100 mV s~*. The wavelike bands 
over —1.0 to —0.5 V seen for the pristine Pt-C/GC 
electrode are attributable to hydrogen adsorption/ 
desorption. 


eroatoms in the NCNTs (fig. S2) (6) to their 
electrocatalytic activities for the ORR. 

Quantum mechanics calculations with B3LYP 
hybrid density functional theory (Gaussian 03) 
(25) indicate that carbon atoms adjacent to ni- 
trogen dopants possess a substantially high posi- 
tive charge density to counterbalance the strong 
electronic affinity of the nitrogen atom (Fig. 2D 
and figs. SS and S6) (/6). Together with the re- 
cent work on the metal-free conjugated PEDOT 
ORR electrode (/3), this result prompted us to 
put forward an O, reduction mechanism on the 
NCNT electrodes. A redox cycling process re- 
duces the carbon atoms that naturally exist in an 
oxidized form by the action of the electrochem- 
ical cycling, followed by reoxidation of the re- 
duced carbon atoms to their preferred oxidized 
state upon O absorption. The nitrogen-induced 
charge delocalization could also change the chem- 
isorption mode of O, from the usual end-on ad- 
sorption (Pauling model) at the CCNT surface 
(top, Fig. 2E) to a side-on adsorption (Yeager 
model) onto the NCNT electrodes (bottom, Fig. 
2E) (26). The parallel diatomic adsorption could 
effectively weaken the O—O bonding to facilitate 
ORR at the NCNT/GC electrodes. As such, dop- 
ing carbon nanotubes with nitrogen heteroatoms 
as in the NCNT electrodes can efficiently create 
the metal-free active sites for electrochemical 
reduction of O>. 

To investigate the stability of the VA-NCNT/GC 
electrode toward ORR, we performed continu- 
ous potential cycling between +0.2 and —1.2 V 
for the VA-NCNT/GC electrode with Pt-C/GC 
as reference in air-saturated 0.1 M KOH for 
~100,000 cycles. As can be seen in Fig. 3, the 
deterioration of Pt occurred, apart from the ORR, 
on the commercial Pt-C/GC (Vulcan XC-72) 
electrode (top curves) (27). The continuous po- 
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Fig. 4. (A) i-t chronoamperometric responses obtained at the VA-NCNT/GC electrode at —0.25 V in 0.1 M 
KOH under magnetic stirring (1000 rpm, MR 3001 K, Heidolph) and Ar-protection over 0 to 2400 s, 
followed by an immediate introduction of air. The arrow indicates the sequential addition of 3.0 M 
glucose, 3.0 M methanol, and 3.0 M formaldehyde, respectively, into the air-saturated electrochemical cell 
(fig. S1B) (26). (B) Corresponding i-t chronoamperometric response obtained at the Pt-C/GC electrode with 
the addition of 3.0 M methanol (as indicated by the arrow) after exposure to air at 1000 s under the same 
conditions as in (A) for comparison. (C) CO-poison effect on the i-t chronoamperometric response for the 
Pt-C/GC and VA-NCNT/GC electrodes. The arrow indicates the addition of 55 mL/min CO gas into the 550 
mL/min QO, flow; the mixture gas of ~9% CO (volume/volume) was then introduced into the 


electrochemical cell (fig. S1B) (16). ip, initial current. 


tential cycling might have caused migration/ 
aggregation of the Pt nanoparticles and subsequent 
loss of the specific catalytic activity. In contrast, 
the VA-NCNT/GC electrode showed almost 
identical voltammetric responses before and after 
the continuous potential cycling under the same 
condition (bottom curves in Fig. 3). 

The VA-NCNT/GC electrode was further 
subjected to testing the possible crossover and 
poison effects in the presence of fuel molecules 
(e.g., methanol) and CO, respectively (27-29). 
To examine the possible crossover effect, we 
measured electrocatalytic selectivity of the VA- 
NCNT/GC electrode against the electrooxidation 
of various commonly used fuel molecules, in- 
cluding hydrogen gas, glucose, methanol, and 
formaldehyde (Fig. 4A). For comparison, the cor- 
responding current-time (i-t) chronoamperometric 
response for a Pt-C/GC electrode given in Fig. 4B 
shows a sharp decrease in current upon the ad- 
dition of 3.0 M methanol. However, the strong 
and stable amperometric response from the ORR 
on the VA-NCNT/GC electrode remained un- 
changed after the sequential addition of hydro- 
gen gas, glucose, methanol, and formaldehyde 
(Fig. 4A). Such high selectivity of the VA- 
NCNT/GC electrode toward the ORR and re- 
markably good tolerance to crossover effect for 


the VA-NCNT/GC electrode can be attributed to 
the much lower ORR potential than that required 
for oxidation of the fuel molecules (30) and is 
further evidenced by the cathodic polarization 
measurements in the presence and absence of 
3.0 M methanol (fig. S4) (76). 

The effect of CO on the electrocatalytic ac- 
tivity of the VA-NCNT/GC electrode was also 
tested because CO poisoning of most noble-metal 
electrodes is a major issue in the current fuel cell 
technology. Being nonmetallic, the VA-NCNT/GC 
electrode was insensitive to CO poisoning even 
after adding about 10% CO in oxygen (red line 
in Fig. 4C), whereas the Pt-C/GC electrode was 
rapidly poisoned under the same conditions (black 
curve in Fig. 4C). This result indicates that the 
VA-NCNT/GC electrode is free from residual iron 
particles, as iron-based catalysts would otherwise 
be readily poisoned by CO (6). 

The high-surface area, good electrical and me- 
chanical properties, and superb thermal stabil- 
ity intrinsically characteristic of aligned carbon 
nanotubes (23, 3/, 32) provide additional ad- 
vantages for the nanotube electrode to be used 
in fuel cells under both ambient and harsh con- 
ditions. Although cost depends on many factors, 
scarcity is not an intrinsic issue for carbon nano- 
tubes. As the number of industrial-scale facilities 


REPORTS 


for the relatively low-cost production of carbon 
nanotubes continues to grow, the price of carbon 
nanotubes is expected to further decrease (33). 
The role of nitrogen-doping demonstrated in this 
study could be applied to the design and de- 
velopment of various other metal-free efficient 
ORR catalysts, and these nitrogen-containing car- 
bon nanotube electrodes are clearly of practical 
importance. 
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Anomalously Metal-Rich Fluids Form 
Hydrothermal Ore Deposits 
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Hydrothermal ore deposits form when metals, often as sulfides, precipitate in abundance from 
aqueous solutions in Earth’s crust. Much of our knowledge of the fluids involved comes from 
studies of fluid inclusions trapped in silicates or carbonates that are believed to represent aliquots 
of the same solutions that precipitated the ores. We used laser ablation inductively coupled plasma 
mass spectrometry to test this paradigm by analysis of fluid inclusions in sphalerite from two 
contrasting zinc-lead ore systems. Metal contents in these inclusions are up to two orders of 
magnitude greater than those in quartz-hosted inclusions and are much higher than previously 
thought, suggesting that ore formation is linked to influx of anomalously metal-rich fluids into 
systems dominated by barren fluids for much of their life. 


ydrothermal ore deposits, formed from 
He flow of hot solutions through porous 

or fractured rocks, are the principal 
source of metals in Earth’s crust (7). Such large 
accumulations of metal require concentration of 
elements hundreds or thousands of times above 
natural abundance, implying high-mass fluxes 
through small volumes of rock coupled with ef- 
ficient precipitation. A fundamental control on 
the formation of hydrothermal deposits is the 
ability of the fluid to carry metals in solution (2). 
Yet, paradoxically, for most deposit types formed 
at low-to-intermediate temperatures, both direct 
analysis of fluid inclusions and theoretical cal- 
culation indicate that the concentrations of dis- 
solved metals are likely to be low, on the order of 
tens of parts per million (3). Also, samples of 
modem crustal fluids, such as those from oil 
fields or mid-ocean ridges, typically contain only 
a few parts per million of Cu, Zn, and Pb (4, 5), 
although there are exceptions, such as the Salton 
Sea geothermal brines in California (6) and oil- 
field waters from central Mississippi (4). A con- 
sequence is that the other parameters that govern 
total metal flux in ore formation (average flow 
velocity and system lifetime) tend toward their 
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likely geological limits in both numerical simu- 
lations and empirical models based on geological 
and geochronological constraints (7, 8). As a 
result, it has been suggested that higher-than- 
normal concentrations of metal in fluids may be 
required to form large ore bodies (9). 

For several decades, a key source of infor- 
mation on the physical and chemical conditions 
of hydrothermal ore formation has been fluid 
inclusions trapped during mineral growth (/0). In 
most deposits, metalliferous ore minerals (com- 
monly opaque sulfides) occur together with un- 
economic transparent phases (gangue). Because 
fluid inclusions in the opaque phases are not 
easily studied by traditional transmitted light mi- 
croscopy and microanalytical methods, the na- 
ture of ore-forming fluids and the conditions of 
ore-mineral precipitation have often been in- 
ferred from the properties of inclusions trapped in 
the associated gangue minerals. However, it is 
often difficult to provide unequivocal evidence 
for coprecipitation based on textural observations 
or isotopic measurements (//); consequently, un- 
certainty remains concerning the temporal and, 
therefore, genetic relationship between gangue- 
hosted inclusions and the ore-forming process. 
Several studies that used infrared light microsco- 
py to observe inclusions in opaque minerals such 
as wolframite and cassiterite have shown that the 
properties of these fluid inclusions may, indeed, 
be different (/2). 

We analyzed fluid inclusions in sphalerite 
(ZnS) from two zinc-lead ore systems with the 
use of laser ablation inductively coupled plasma 
mass spectrometry (LA-ICPMS). Primary inclu- 
sions in sphalerite must represent the ore-forming 
fluid because they are trapped during growth of 
the ore mineral itself. Unlike bulk analytical 
studies that are limited to a few major elements 


and that may sample multiple populations of in- 
clusions (/3), LA-ICPMS allows determination 
of trace elements (including ore metals) in single, 
texturally constrained inclusions. 

We selected samples from two well-studied 
hydrothermal ore systems. The Northern Arkan- 
sas district of the Ozark Plateau, North America, 
is an example of low-temperature Mississippi 
Valley-Type (MVT) zinc-lead mineralization, 
thought to have formed by continent-scale ba- 
sinal brine migration (/4). The Midlands Basin 
orefield in Ireland contains several large zinc- 
lead(-barium) ore deposits formed from moderate 
temperature fluids generated by deep crustal 
circulation of seawater-derived brines during 
continental rifting (75, /6). Both systems are 
economically noteworthy but provide a contrast 
in terms of sources of metals, sulfur, and hydro- 
logical regime. Lead is of particular interest be- 
cause it needs to be concentrated above average 
crustal abundance more than any other common 
ore-forming element (~4000 times) to form a 
potentially economic accumulation. 

Samples from Northern Arkansas were col- 
lected from exposures in the Monte Cristo and 
Philadelphia Mines of the Rush subdistrict and 
from the Lucky Dog Mine of the Tomahawk 
Creek subdistrict. They comprise fine- to coarse- 
grained crystalline quartz and medium- to 
coarse-grained pale yellow—to—brown sphalerite. 
Regionally, precipitation of sphalerite typically 
overlapped with that of jasperoid and finely 
crystalline quartz, and more coarsely crystalline 
quartz formed later (7). Samples from Ireland 
were collected from historic mine exposures and 
drill core from the Silvermines deposit, as well as 
from quarry outcrop of quartz-sulfide veins near- 
by. The deposit samples are composed of mas- 
sive sulfide dominated by coarse-grained brown 
sphalerite that replaces early disseminated gran- 
ular and framboidal pyrite. The vein sample is 
composed of quartz and ankerite, as well as 
minor sphalerite and galena, and was selected as 
a representative example of a regionally devel- 
oped set of feeder veins developed underneath 
the ore deposits (78, 19). 

Salinity data derived from freezing experi- 
ments (20) show that the Northern Arkansas 
mineralization formed from brines, typical of 
MVT deposits (Fig. 1). Assuming fluids were 
trapped at hydrostatic pressure at depths of <2 
km, the inferred depth of ore formation (/4), we 
calculated an isochoric correction of <+10°C to 
recorded homogenization temperature (7j,) val- 
ues to give true trapping temperatures. Thus, 7, 
can be regarded as a reasonable approximation of 
fluid temperature during mineral growth. Inclu- 
sions from the Irish samples display lower salin- 
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ity and higher 7, values than the MVT fluids 
(Fig. 1), typical for the Irish orefield (//). For 
the Irish ores, which formed at shallow depth 
(16, 19), any correction to homogenization tem- 
peratures will again be small so that 7;, values can 
be regarded as a good proxy for fluid trapping 
temperature. 

Laser ablation analyses were carried out with 
the use of a New Wave UP213AI, 213-nm 
aperture-imaged laser ablation system (20-22) 
on primary fluid inclusions interpreted to have 


formed during initial mineral growth based on con- 
ventional petrographic criteria (10). Some second- 
ary inclusions, formed during later fracturing and 
annealing, were analyzed for comparison (Fig. 2). 
Lead and other elements of interest in hydrothermal 
systems such as Ba and Mn are clearly present in 
the fluid phase, as indicated by their good corre- 
lation with Cl in inclusion signals (Fig. 3). Full 
data are reported in tables S1 and S2 (20). 

Zn concentrations in primary quartz-hosted 
inclusions from Northern Arkansas are low, rang- 
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Fig. 1. (A) Quartz wafer from Northern Arkansas showing primary growth zones (GZ). (B) 
Magnification of inset shown in (A) illustrating complex distribution of fluid inclusions, together 
with some secondary trails (S) and selected primary inclusions (P) within growth clusters. (C) 
Sphalerite wafer from Ireland showing growth zones defined by fine fluid inclusions and color 
banding, together with euhedral primary inclusions. (D) Plot of fluid temperature and salinity data 
derived from microthermometry. Salinity was estimated from the freezing point depression of ice, 
modeled in the NaCl-H20 system (20). For Northern Arkansas [data previously shown in (27)], 
quartz from Monte Cristo and sphalerite (Sp) from Monte Cristo and Philadelphia contain 
apparently identical primary inclusions. Slightly lower salinity primary inclusions are found in 
sphalerite from the Lucky Dog mine (20 km to the southwest), indicating geographic variability in 
brine composition. In the Irish Orefield, UG-1 sphalerite contains primary and secondary inclusions 
of a less saline brine [12 to 15 weight percent (wt %) NaCl equivalent] and a trail of inclusions of 
more saline fluids (16 to 18 wt % NaCl equivalent). This cuts a growth zone boundary (Fig. 2), 
indicating that these fluids are younger than this surface and its associated primary inclusions, but 
the exact timing with respect to other inclusions in the sample is uncertain (U). Fluids in the more 
saline population are trapped as primary inclusions in sphalerite 75-85-104. The data display a 
bimodal salinity distribution that mirrors the distribution observed in regional-scale fluid inclusion 
studies (see histogram at right), suggesting that these modes reflect multiple pulses of districtwide 
flow affecting a rock volume estimated at >130,000 km?. Analogous brine pulses have been 
inferred in the recent history of the Salton Trough geothermal field (6). The evidence noted above, 
together with crosscutting relations observed in other samples, suggests that the higher salinity 
fluid pulse is later and is associated with the majority of the sphalerite in the district. The inclusions 
analyzed by LA-ICPMS in sphalerite sample 75-85-104 are not plotted because homogenization 
experiments could not be carried out due to problems with leakage. 
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ing from 0.12 to 12.3 parts per million (ppm). 
In Irish quartz, except for one primary inclusion 
(14 ppm), Zn was below the limit of detection 
(mean = 37.6 ppm) because of the small inclu- 
sion size. Such low Zn concentrations are con- 
sistent with previously reported bulk analyses of 
3.4 to 6.0 ppm (/6). Unfortunately, we were not 
able to measure the Zn concentration of inclu- 
sions trapped in sphalerite because of the over- 
whelming host mineral contribution to the laser 
ablation signal. A similar problem also occurred 
for Cu (Fig. 3). However, Pb can be used as an 
indicator of the ore metal content of these 
inclusions, as it does not substitute appreciably 
into sphalerite, and its presence there can be cor- 
rected for (20). 

In Northern Arkansas, Pb concentrations dis- 
play a marked bimodal distribution, ranging from 
0.2 to 3.5 ppm in quartz and primarily from 10 to 
400 ppm in sphalerite (Fig. 4). This excludes five 
sphalerite-hosted inclusions that fall in the lower 
population, interpreted to represent unrecognized 
secondary inclusions trapping fluid related to the 
later quartz. The quartz-hosted inclusion data are 
consistent with a 266-nm LA-ICPMS study that 
found that fluid inclusions in gangue minerals 
from the Southeast Missouri MVT district con- 
tained Pb, Zn, and Cu concentrations below in- 
strumental detection limits of ~10 ppm (23). The 
Monte Cristo and Philadelphia sphalerites have 
similar mean Pb concentrations (80 ppm) that are 
lower than the Lucky Dog sphalerites (119 ppm). 

In Ireland, we also observed a distinction 
between the Pb content of primary inclusions in 
quartz (3.6 to 26 ppm) and sphalerite (22 to 890 
ppm) (Fig. 4). The quartz-hosted inclusion data 
are consistent with bulk fluid inclusion analyses 
for Irish feeder veins that gave Pb concentrations 
of 11.4 to 19.8 ppm (/6). The more saline pri- 
mary inclusions in sphalerite 75-85-104 have a 
higher mean Pb concentration (430 ppm) than the 
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Fig. 2. Transmitted light digital photomontage of 
double-polished fluid inclusion wafer (~100 um 
thick) of Irish sphalerite UG-1. Shown are individ- 
ual fluid inclusions analyzed (numbered) and 
average determined Pb concentrations for each 
population of primary, secondary, or uncertain 
inclusions (same as those referred to in Fig. 1). 
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Fig. 3. (A). Example of a time-resolved laser A 
ablation spectrum for a fluid inclusion in 
sphalerite (primary inclusion 9, Irish sphal- 
erite 75-85-104). Initially, gas background 
was acquired and then the laser was turned 
onat 24s. Signals for °°Zn and °°Cu increase 
as sphalerite begins to be ablated. The 
inclusion was breached at ~52 s. The signal 
was integrated offline over an ~16-s interval. 
The y axis is scaled from maximum to mini- 
mum recorded counts per second (cps) for 
each individual isotope. (B) The good corre- 
lations between the intensity for >°Cl (only 
present in the fluid inclusion) and all iso- 
topes measured (except ®Cu) through the 
integration interval confirms their predomi- 
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nance in the fluid phase. This analysis returned 160 ppm Li, 530 ppm Mg, 7740 ppm K, 17500 ppm Ca, 530 ppm Mn, 2750 ppm Sr, 1770 ppm Ba, and 61 ppm Pb. 
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Fig. 4. Concentrations of Zn versus Pb in quartz-hosted fluid inclusions and histograms showing 
measured Pb concentrations in quartz- and sphalerite-hosted inclusions, compared with natural 
brine data from broadly analogous environments. Predicted Pb-Zn covariation for sphalerite-hosted 
inclusions indicated by fields, based on the extrapolation of empirical Pb-Zn concentrations, 
including data from modern oil-field waters (24, 25) adjusted to mass/mass units and from the 
Salton Sea geothermal field (6). Both of these data sets have values that fall in our “high-metal” 
fluid fields, suggesting that they are anomalously metalliferous and viable ore-forming fluids. 
Typical Zn:Pb ratios (by mass) for natural fluids are in the range of 1 to 10 (dashed lines) and tend 
toward higher values at elevated concentrations. Five data points from the Northern Arkansas data 
set with low detected Pb are inferred secondary inclusions. Short bars on symbols indicate that the 
plotted value is a maximum (limit of detection) value for that element. Qz, quartz; Sp, sphalerite; 
MC, Monte Cristo; PA, Philadelphia; LD, Lucky Dog. Some of the Northern Arkansas data shown are 


presented in (27), figures 6 and 7. 


inclusions in sphalerite UG-1 (120 ppm); sec- 
ondary inclusions in both samples display similar, 
lower concentrations. Two secondary inclusions 
in quartz contained higher metal concentrations 
(27 to 128 ppm) (Fig. 4) than the primary in- 
clusions, indicating overprinting by later, more 
metalliferous fluids such as those trapped in 
sphalerite. 

Pb and Zn concentrations are commonly 
correlated in hydrothermal solutions as a result 
of their similar geochemical behavior and 


potential buffering by their respective sulfides 
(4, 24). This is illustrated by their empirical 
covariation in modern basin brines (4, 6, 24-26) 
and in the quartz-hosted inclusions in this study 
(above ~1 ppm) where both metals were de- 
termined (Fig. 4). This relation allows us to 
model the likely range of Zn concentrations to be 
expected in the sphalerite-hosted inclusions at the 
time of trapping: up to 5000 ppm in Irish sphal- 
erite and 3000 ppm in Northern Arkansas. The 
latter estimate is ~60% of the value predicted 


theoretically from thermodynamic data using 
measured Pb concentrations, assuming galena 
and sphalerite saturation and making reasonable 
assumptions about pH and oxygen fugacity (/7). 
Although these are only order-of-magnitude 
estimates, it is clear that the fluids precipitating 
sphalerite were markedly enriched in Pb and Zn 
compared with those precipitating quartz in the 
two systems. 

It could be argued that the fluid inclusions in 
quartz represent spent ore fluids, trapped after 
sulfide precipitation had already taken place. This 
is thought to be unlikely in the Irish deposits 
because the quartz was sampled from largely 
barren vein systems that formed beneath the ore 
deposits, in hydrothermal upflow zones (/6), and 
the fluids show no signs of having substantially 
cooled or mixed (Fig. 1), as is known to occur 
during mineralization (/6, 19). In the Tri-State 
MVT district, we found sphalerite- and quartz- 
hosted brine inclusions to have distinct halogen 
signatures, indicating that the fluids had separate 
origins (/7). The fertile ore fluids appear to have 
originated during strong evaporation of seawater 
at Earth’s surface, before later burial and expul- 
sion. This observation contrasts with the barren 
fluids that evolved from less strongly evaporated 
seawater (/7). The metalliferous fluids are there- 
fore linked to the paleoclimate and the specific 
characteristics of the sedimentary aquifer within 
which they were trapped. The development of 
low-pH surface brines in the U.S. mid-continent 
in the Permian via sulfide oxidation (27) is an 
intriguing possible origin for such metal-prone 
fluids. Not only will unusually low pH enhance 
metal solubilization, but the reservoir rocks will 
be depleted in reduced sulfur that would other- 
wise limit metal take-up. The nonideal behavior 
of Cl at high ionic strengths, coupled with low 
pH and chloride complexing, has been cited as a 
control of high Zn and Pb concentrations in the 
most saline modern oil-field waters (6), but this 
does not account for the low metal concentrations 
observed in similar salinity brines in some oil 
fields or those trapped in gangue minerals in this 
study (Fig. 4). Irrespective of the origin, both 
numerical models (8) and empirical observation 
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of brine heterogeneity in modern sedimentary 
basins (6) imply inefficiency of mixing and the 
potential for preservation of individual, metal- 
charged brine reservoirs that could be tapped at 
some later time. 

In the case of Ireland, the origin of metal- 
enriched fluids is uncertain, although a deepening 
convective flow system (/5) has the capability to 
extract higher concentrations of metal later in the 
life of the system due to increasing temperature 
and possibly also progressive exhaustion of the 
buffer capacity for pH (by feldspar-mica) or 
the activity of H2S (by pyrite-Fe silicate) on the 
convective flow path. The observation that the 
texturally later brines have higher metal contents 
is consistent with this model, although their 
higher salinity is also likely to have contributed 
to enhanced metal transport. The high Ba content 
of the metalliferous fluids (up to 6000 ppm) 
indicates that the oxidized sulfur content must 
have been low, limited by barite saturation. 
Combined with high base metal concentrations 
that imply low reduced sulfur concentrations, we 
conclude that a key property of these fluids was 
low XS (total sulfur concentration). 

High metal concentrations may pertain in 
other types of hydrothermal ore systems, such as 
epithermal or volcanic-hosted massive sulfide 
deposits. In these environments, periodic injec- 
tion of metalliferous magmatic fluids may be 
responsible for the bulk of metal introduction 
(28-30) into systems otherwise dominated by 
barren geothermal waters. A number of large, 
high-temperature, magmatic-hydrothermal de- 
posits are also known to have formed from 
magmatic fluids that contained very high con- 
centrations of ore metals (3/—33). Accepting that 
hydrothermal ores may form specifically from 
anomalously metal-rich batches of fluid implies 


geochemically specialized source regions and an 
episodicity and potentially short duration of ore- 
forming events that may be controlled by changes 
in hydrology. Although the existence of an ef- 
ficient trap for these metals remains a fundamental 
prerequisite for hydrothermal ore formation, our 
interpretation contrasts with the view that many 
crustal fluids are viable ore fluids subject to the 
right perturbations in physicochemical conditions 
to cause efficient deposition (24). 
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Signature of the End-Cretaceous Mass 
Extinction in the Modern Biota 


Andrew Z. Krug,? David Jablonski,’ James W. Valentine” 


The long-term effects of mass extinctions on spatial and evolutionary dynamics have been 
poorly studied. Here we show that the evolutionary consequences of the end-Cretaceous 
[Cretaceous/Paleogene (K/Pg)] mass extinction persist in present-day biogeography. The geologic 
ages of genera of living marine bivalves show a significant break from a smooth exponential 
distribution, corresponding to the K/Pg boundary. The break reflects a permanent increase in 
origination rates, intermediate between the Mesozoic rate and the post-extinction recovery pulse. 
This global rate shift is most clearly seen today in tropical bioprovinces and weakens toward the 
poles. Coupled with the modern geographic distributions of taxa originating before and after the 
K/Pg boundary, this spatial pattern indicates that tropical origination rates after the K/Pg event 
have left a permanent mark on the taxonomic and biogeographic structure of the modern biota, 
despite the complex Cenozoic history of marine environments. 


each of which removed >50% of well- 
preserved genera and perhaps >70% of 
their species (/, 2), irreversibly restructured the 
taxonomic composition of the global biota. 


T= major extinctions of the geologic past, 


Although the broad macroevolutionary conse- 
quences of mass extinctions are well known (as 
in the dinosaurs-mammals changeover), their 
long-term effects on the temporal and spatial 
dynamics of clades and biotas are rarely inves- 


tigated. For example, the good fit of modern bio- 
diversity to local temperatures and to refugia 
from recent glaciations (3, 4) might imply that the 
recovery from the most recent major extinction, 
at the end of the Cretaceous Period 65 million 
years ago (Ma) [the Cretaceous/Paleogene (K/Pg) 
extinction], is largely obscured by subsequent 
events. 

Here, we show the lasting influence of the 
K/Pg mass extinction on the evolutionary and 
biogeographic structure of modern biotas, using 
backward survivorship curves (BSCs, also called 
prenascence curves), which plot the proportion of 
taxa within a cohort that originated before some 
reference time (5, 6). Assuming that rates are 
time-specific and taxonomically homogeneous 
(5), a BSC defines an exponential probability 
function governed only by the origination rate (A) 
of the cohort (7). The slope of a BSC is insen- 
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sitive to variations in extinction rate, so any sig- 
nificant changes in that slope signal major shifts 
in the origination rate (5). BSCs are particularly 
useful here because they let us (i) estimate in- 
trinsic rates within bivalve cohorts, as opposed to 
a building a time series of individual events; and 
(ii) analyze the structure of age-frequency dis- 
tributions in modern biotic provinces in compar- 
ison to the global distributions. 

We analyzed (7) a taxonomically standard- 
ized, spatially explicit, global database of shallow- 
water living marine bivalves (8, 9), which now 
includes 73 families (table S1), 854 genera and 
subgenera (711 with a fossil record), 5132 spe- 
cies, and 28,264 spatial occurrences from 322 
localities (fig. S1). Marine bivalves are one of the 
few groups that allow a direct analysis of long- 
term evolutionary and biogeographic dynamics: 
They have a rich and spatially extensive fossil 
record, their systematics are increasingly well 
known and standardized, and imperfections in 
their record are increasingly understood in terms 
of preservational controls (/0—/5). Their diversi- 
ty patterns today and through the Cenozoic cor- 
respond closely to those of other major marine 
clades, both spatially and temporally (/6, 17), 
and so they are a robust model system for prob- 
ing the evolutionary and biogeographic history of 
the global biota. Localities were grouped into 
provinces according to sources cited in (7). BSCs 
were constructed (5, /8, 19) using the geologic ages 
of genera in an extensively revised and updated 
version of Sepkoski’s compendium (8, 10, 20). 
We tested the fit of one-parameter, three-parameter, 
and five-parameter probability functions (7) to 
the global and provincial data, using the optim 
function in the software package R. 

The BSC for extant bivalves globally shows a 
distinct inflection point at ~65 Ma (Fig. 1, A and 
B), with an increase in origination rate (A) after 
the mass extinction (from 0.015 to 0.029 genera 
per lineage per million years) that continues to 
the present. A three-parameter model, incorpo- 
rating two probability functions (with rate param- 
eters A, and A,; see methods in supporting online 
material) describing the BSC on either side of the 
inflection point (fi), 18 statistically supported 
over a single exponential fit (table S1). The 
optimizations place the inflection point near the 
K/Pg boundary, maximally supporting an age of 
69 million years (My), not distinct from the K/Pg 
boundary, given the resolution of our binning 
scheme (7); the difference between the support of 
a critical age of 65 versus 69 My is not statis- 
tically meaningful (fig S2). This slight offset may 
also reflect less intense study in the 75- to 100- 
My interval, which could artificially concentrate 
originations in the last 10 My of the Cretaceous 
(2). This result indicates a significant and perma- 
nent increase in the origination rate of extant 
marine bivalve clades after the K/Pg extinction. 

A discrete rebound phase with a steeper slope 
is also visible in semi-log space for the global 
data set (Fig. 1B), verified by a five-parameter 
(three rate parameters and two inflection points) 


model with a slope of 0.065 (four times the 
Mesozoic value and twice the overall Cenozoic 
value) and bounded at 65 and ~50 Ma. This 
short-term origination pulse is consistent with 
patterns described for post-K/Pg bivalves (2/) 
and for other post-extinction intervals, and pat- 
terns of diversification in recovery episodes clearly 
stand as distinct from those in more quiescent 
times. However, this transient increase in origi- 
nation does not bias the estimated rate differential 
between the broader Mesozoic and Cenozoic 
intervals. The rate estimated for the final 50 My 
of the Cenozoic (0.026) is very similar to that for 
the entire Cenozoic (0.030), indicating that the 
overall Cenozoic rate, A, estimated with the 
three-parameter model above (table S2), is math- 
ematically dominated by the shift in “normal” 
origination rates rather than the post-extinction 
rebound interval. 

The evolutionary signature of the K/Pg event 
is also seen at the provincial scale (Figs. 1C and 


Fig. 1. Global and regional BSCs for living 
marine bivalves; the vertical red line marks 
the K/Pg boundary (65 Ma). (A) Global BSC 
plotted on arithmetic axes. (B) Global BSC 
in semi-log space showing constant rates 
and different slopes of the pre- and post-K/Pg 
portions of the curve. Straight lines in semi- 
log space represent constant exponential 
rates. (C) Decrease in BSC slopes with in- 
creasing latitude for the Northern Hemi- 
sphere bioprovinces of the eastern Pacific. 
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2), with a strong latitudinal component despite 
the apparent global homogeneity of the extinc- 
tion itself (22). The three-parameter function, 
with an inflection point near the K/Pg boundary, 
is the optimal fit for 19 of the 27 provinces; the 
provincial curves show some additional varia- 
tion, but minor inflection points are not supported 
when the five-parameter model is used (perhaps 
due to small sample sizes in provinces with addi- 
tional non-K/Pg inflections, such as the Lusitanian; 
Fig. 2). Provincial values of 4 decrease with lat- 
itude (Fig. 1C), so that a one-parameter expo- 
nential function is best supported for two polar 
and six high-latitude provinces (table $2). This 
latitudinal effect is seen most clearly when a 
critical time of 65 My is imposed on all 27 
provinces and the probability functions are fit to 
the data: The difference between the post- and 
pre-K/Pg rates (A, — A) declines significantly as 
the latitudinal midpoint of the province increases 
(Fig. 3A), a pattern driven predominantly by the 
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shallowing of the estimated Cenozoic rate (A) in 
high-latitude provinces (Fig. 3B). Pre-K/Pg rates 
show no such latitudinal trend (Fig. 3C). The 
tropical West Pacific and Arctic provinces show 
the extremes of this contrast (fig. S3): The pre- 
K/Pg slopes of these two provinces differ only 
slightly (many of these older taxa are shared by 
the two regions), but the difference after the K/Pg 
extinction is dramatic. 

The ages used here are the global first ap- 
pearance of a genus (as opposed to the first oc- 
currence of a genus in a given province), so that 
the slope of the provincial curves represents the 
outcome of origination plus invasion minus local 
extinction, rather than simple in situ origination. 
However, the late Cenozoic fossil record con- 
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Fig. 2. Backward sur- 
vivorship curves for the 
27 provincial faunas of 
the present-day shelf 
biota. Plots are on ar- 
ithmetic axes to enable 
visualization at low di- 
versities; vertical red lines 
mark the K/Pg boundary 
(65 Ma). 
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strains these data by showing that marine bivalve 
genera tend to appear first in the tropics, with 
range expansion being overwhelmingly unidi- 
rectional from the tropics to the poles (8, 9, 23). 
The shallowing of Cenozoic age-frequency curves 
from tropics to poles thus appears to reflect the 
decreasing probability for genera to reach and 
remain established in progressively higher lat- 
itudes (9). Therefore, though the end-Cretaceous 
extinction was equally severe in low and high 
latitudes (22), the spatial pattern of origination 
and spread affects the taxonomic structures of 
these regions differently; the post-K/Pg slopes of 
BSCs primarily reflect origination rates in tropi- 
cal provinces and invasion rates in polar prov- 
inces. This view is supported by the contrasting 


Arctic Californian 
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geographic distributions of pre- and post-K/Pg 
genera (Fig. 4). The distribution of post-K/Pg 
genera today forms a hollow curve, with 32% of 
the 638 genera being present exclusively in trop- 
ical or warm-temperate provinces (Fig. 4). Pre- 
K/Pg genera have significantly wider distributions, 
with only 14% of 73 genera being exclusive to 
warm-water provinces (significantly fewer than 
in the post-K/Pg sample, log-likelihood ratio G 
test, P < 0.001), the majority (82%) inhabiting 
both warm- and cold-water regions (Fig. 4). 
Because the present day is our reference time, 
our BSCs are reminiscent of lineage-through- 
time (LTT) plots, which estimate diversity dy- 
namics through time using the divergence times 
of lineages in a phylogenetic tree constructed for 
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living species (24-27). BSCs differ importantly 
from LTTs, however, in that the frequency dis- 
tribution of branch lengths (here the age of the 
genus) is the raw data, rather than the length of 
time between nodes on a phylogenetic tree (28). 
Therefore, although changes in extinction rate 
can influence the slopes of LTTs (24, 28), these 
effects are mathematically cancelled out in BSCs 
(5, 28), given the assumptions noted above. 
Nevertheless, we verified our global results 
by calculating the average per-taxon origination 
rates (29) [termed “per-capita rates” in (29), but 
units are actually per lineage per million years] of 
all genera, living and extinct, within extant bi- 
valve families for 5-My intervals within the post- 
recovery Cenozoic (0 to 50 Ma) and for the 50 
My before the K-Pg boundary. This form of per- 
capita rate, derived from branching theory, is 
robust to variation in the true extinction rate, 
including events as dramatic as mass extinctions 
(29). Focusing on shorter 5-My intervals also 
limits the biasing effect of taxonomic heteroge- 
neity on the estimated rates, so this analysis, 
including extinct genera, makes a useful com- 
parison to the results for living genera alone. 
These per-capita origination rates are remarkably 
close to those determined for the living genera, 
with the rate for the 50 My before the K/Pg at 
0.020 genera per lineage per million years 
(versus 0.014 for the living cohort) and the 


post-K/Pg rate at 0.026 per lineage per million 
years (versus 0.026 for the living cohort), a sta- 
tistically significant increase (Student’s ¢ test, P= 
0.02, normality of data confirmed by Shapiro- 
Wilk test; a nonparametric Wilcoxon signed-rank 
test gives a similar result, P = 0.025) that 
corroborates the shift in origination rate at the 
K/Pg boundary. The per-taxon rate calculations 
do not specifically address the assumptions 
underlying the rates estimated from the BSC of 
the living cohort, but the agreement of the two 
results suggests that the inference of an inflection 
point in the BSC is robust. Spatially explicit LTT 
plots and paleontological cohort analyses will be 
valuable next steps but are difficult to generate 
because local extinction rates can include strong 
sampling effects. 

The sharp increase in the slope of the bivalve 
BSC after the K/Pg event is unlikely to derive 
entirely from improved preservation and sam- 
pling in the younger part of the record and in the 
present day. Our optimizations find the changes 
in slope at the K/Pg boundary and in the early 
Eocene, not in the late Cenozoic when lithifi- 
cation and dissolution impinge least on fossil pre- 
servation (30). Additionally, an analysis restricted 
to bivalves having shell microstructures most 
robust to geochemical dissolution (with low or- 
ganic content, consisting predominantly of calcite) 
and thus producing records least subject to dis- 
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tortion over time (//) shows a pattern consistent 
with that of the entire bivalve cohort (fig. S4). 

These results provide a long-term perspec- 
tive on the consequences of the K/Pg mass 
extinction, with a significant shift in origina- 
tion rate that outlasted the accelerated recov- 
ery phase, and a strong spatial component to 
diversification, such that tropical and warm- 
temperate regions carry the strongest signature 
of post-K/Pg evolution. The lasting increase in 
origination may reflect the removal of com- 
petitors or predators that were damping the 
Mesozoic diversification of the living families; 
alternatively, the Cenozoic diversification of 
shell-penetrating predators, particularly in the 
tropics (3/7, 32), might have increased origination 
rates in their prey during the post-recovery 
interval. These hypotheses need to be tested 
directly; for example, by ecological and func- 
tional analyses of victims and post-extinction 
diversifiers. The declining strength of the appar- 
ent shift in global origination rates with latitude 
underscores how the taxonomic composition of 
high-latitude provinces depends on tropical orig- 
ination rates. The K/Pg extinction was large, but 
its downstream effect has also been important 
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Fig. 4. Present-day geographic ranges of bivalve 
genera that originated (A) after or (B) before the 
K/Pg extinction. Genera originating before the K/Pg 
extinction (B) have significantly greater geographic 
ranges today, determined as the number of bio- 
provinces in which a genus occurs (median range of 
pre-K/Pg genera = 10 provinces, median range of 
post-K/Pg genera = 4 provinces, Kolmogorov- 
Smirnov test, P = 2.16 x 10), and significantly 
fewer exclusively warm-water genera than (A) genera 
originating after the K/Pg. Genera that exist 
exclusively in provinces with tropical or warm- 
temperate climates are denoted in black. All other 
genera have ranges extending into (but not 
necessarily restricted to) cold-temperate or polar 
provinces. The vast majority of such genera (91%) 
also occur in warm-water provinces. 
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and ongoing, so that the long-term shift in orig- 
ination rate, and the spatial pattern of those orig- 
inations, continues to structure the biological 
provinces of the modern world. Despite 65 My of 
evolutionary and environmental change, the mod- 
ern marine biota, globally and regionally, bears the 
evolutionary and biogeographic imprint of the 
K/Pg event. 
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A Great-Appendage Arthropod with a 
Radial Mouth from the Lower 
Devonian Hunsriick Slate, Germany 


Gabriele Kiihl,* Derek E. G. Briggs,” Jes Rust? 


Great-appendage arthropods, characterized by a highly modified anterior limb, were previously 
unknown after the Middle Cambrian. One fossil from the Lower Devonian Hunsriick Slate, Germany, 
extends the stratigraphic range of these arthropods by ~100 million years. Schinderhannes bartelsi 
shows an unusual combination of anomalocaridid and euarthropod characters, including a highly 
specialized swimming appendage. A cladistic analysis indicates that the new taxon is basal to 
crown-group euarthropods and that the great-appendage arthropods are paraphyletic. This 

new fossil shows that features of the anomalocaridids, including the multisegmented raptorial 
appendage and circular plated mouth, persisted long after the initial radiation of the euarthropods. 


range of Cambrian arthropods share a 
Aeenines limb at the front of the head 
referred to as the “great appendage” (/). 
They include the Anomalocarididae and Cam- 
brian euarthropods such as Yohoia, Leanchoilia, 
Jiangfengia, and Fortiforceps and are usually in- 
terpreted as a paraphyletic group within the crown, 
basal to chelicerates, and the great appendage to 
be homologous with the chelicera of living chelic- 
erates (/—-6). Alternatively, they have been consid- 
ered basal to the crown-group euarthropods, in which 
case the great appendage has been lost in living taxa 
(7, 8). Here, we describe a great-appendage arthropod 
from the famous Lower Devonian Hunsriick Slate 
(9). This arthropod shows a previously unseen com- 
bination of the characters that occur in Cambrian 
anomalocaridids and euarthropods. 
Schinderhannes bartelsi, a new genus and spe- 
cies, is a representative of the stem lineage of the 
Euarthropoda. The name Schinderhannes is derived 
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from an 18th-century bandit in the Hunsrtick area 
and bartelsi is a tribute to Christoph Bartels of 
Bochum, Germany, an authority on the Hunsriick 
Slate fossils. The material is a single specimen: PWL 
1994/52-LS is the holotype, located at the Natur- 
historisches Museum Mainz. The specimen has 
been x-radiographed (Fig. 1F), drawn (Fig. 1, B and 
D), and reconstructed (Fig. 1G). It was discovered in 
the Hunsrtick Slate, Germany, at the Eschenbach- 
Bocksberg Quarry in Bundenbach (/0); it was 
found in the Lower Devonian Kaub Formation. 
Diagnosis. The head bears a preoral great ap- 
pendage with at least nine podomeres, a radial mouth, 
and huge eyes. Podomeres three to nine of the great 
appendage bear an elongate lateral spine, and 
podomeres four to nine have a long, ventral, comb- 
like projection. An additional large triangular 
flaplike appendage at the rear of the head is 
thickened along its margin. The trunk consists of 
12 segments: The first 10 have tergites and bear 
biramous appendages, the 11th bears a lateral 
flukelike appendage, and the 12th is apodous. 
The trunk terminates in a long tail spine. 
Description. The total length, including the 
tail spine, is 98 mm. The specimen (Fig. 1, A and 


B) is oriented parallel-oblique to the bedding so 
that the ventral side shows the left eye in outline 
and reveals the outer surface of the left edge of 
the trunk tergites (fig. S1, C and D). Some fea- 
tures, such as the head shield, are obscured by 
incomplete pyritization and phosphatization. The 
outline of the large postoral head appendage has 
been affected by flattening: The left is fore- 
shortened as compared with the right, which shows 
striking evidence of wrinkling. This appendage is 
difficult to interpret without additional examples 
preserved in other attitudes to the bedding. 

The body is divided into head and trunk. The 
head bears a pair of preoral great appendages 
(Fig. 1, B to E, and fig. S1A), a circular mouth struc- 
ture, a pair of huge eyes, and a pair of flaplike 
limbs at the rear. Although there is no clear evi- 
dence of a head shield, the presence of dorsal 
tergites in the trunk region suggests that one was 
present (fig. S1D). 

The great appendage (A1) (Fig. 1, B and E, 
and fig. S1A) consists of up to nine podomeres. 
Podomeres one and two are short and wide, 
whereas podomere three is three times longer and 
bears a lateral spine (Fig. 1, E and G). Podomeres 
four to nine also bear a long inwardly projecting 
lateral spine and a long parallel-sided ventral 
projection that extends at a high angle to the 
appendage (Fig. 1, B, E, and G, and fig. SIA). 
This projection decreases in length from the 
fourth to the ninth podomere. Each one bears a 
series of up to five or six regularly spaced orthog- 
onal spines along its length, decreasing in num- 
ber on the more distal podomeres. A subcircular 
mouth ring (Fig. 1, F and H, and fig. S1B) is 
concealed by the ventral projections. A series of 
small subtriangular features, particularly on the 
left side, may represent tooth plates. Differences 
in width may reflect its attitude to the bedding. 

The stalked eyes are positioned laterally 
behind the great appendages. The left eye in- 
dicates a suboval outline, although it is affected 
by flattening. The eyes are covered in numerous 
tiny, close-packed, hexagonal lenses (fig. S1C). 
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A subcircular darker area between the eyes con- 
sists of apatite, presumably indicating the posi- 
tion of the gut. 

A second highly differentiated appendage (A2) 
is interpreted as belonging to the head (Fig. 1, A 
and B). It is a wide, triangular, flaplike structure 
with a broad proximal area (Fig. 1F). The ridge- 
like appearance of the margins represents thicken- 
ing, and they preserve evidence of short spines 
(Fig. 1B). 

The trunk consists of 12 segments (Fig. 1B). 
The first 10 bear a pair of homogeneous biramous 
appendages. The endopods are flat and made up 


of at least three wide podomeres (fig. S1F). The 
proximal two podomeres are short and similar in 
length; the third is twice as long and terminates in 
several elongations. The exopod consists of an 
unsegmented axis bearing numerous flaps (fig. 
S1, D and E). The appendages correspond in po- 
sition to the tergites. The last two segments of the 
trunk lack biramous appendages. Segment 11 
bears a pair of lateral flukelike limbs with thickened 
outer and posterior margins; they partially over- 
lap segment 12 (Fig. 1, A and B). 

A concentration of apatite along the axis of 
the trunk may reflect the position of the gut. The 


ventral right 


anus is located at the posterior margin of the last 
trunk segment, indicated by a circle of pyrite. The 
trunk terminates in a long unsegmented spine 
with a median keel. 

The spiny great appendage suggests that 
Schinderhannes was a predator [for example, 
(2)]. The large eyes imply a substantial visual 
capability. The flukes at the rear of the trunk 
indicate that swimming was the main mode of 
locomotion. Thrust was presumably generated 
by the large postoral appendage (A2), which 
may have functioned as a paddle or “wing,” as 
was deduced for the lobes in the trunk of 
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Fig. 1. Holotype of Schinderhannes bartelsi. (A) Ventral. (B) Interpretative 
drawing of ventral side. lL, left; r, right; A1, great appendage; A2, flaplike 
appendage; sp, spine; fm, flap margin; te, tergite; ta, trunk appendage. (C) 
Partly exposed dorsal side, horizontally mirrored. (D) Interpretative draw- 


ing of dorsal side. (E) Interpretative drawing of great appendages, com- 
bining information from the dorsal and ventral sides. (F) Radiograph. (G) 
Reconstruction. Scale bar, 10 mm [for (A) to (G)]. (H) Mouth-part. Scale bar, 
5 mm. 
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Anomalocaris (11). Lift was also generated by 
the tail flukes. 

Schinderhannes shares several characters with 
anomalocaridids. The morphology and position of 
the great appendage, which lies at the front of the 
head and anterior of the eyes, are very similar to 
that of “Appendage F” (/2), the frontal appendage 
of Laggania cambria (11, 13). The circular mouth 
is characteristic of anomalocaridids (/3), and flap- 
like appendages similar to that at the rear of the 
head of S. bartelsi occur in the trunk of Anomalo- 
caris canadensis (11), Parapeytoia yunnanensis 
(14), Pambdelurion whittingtoni (15), and Keryg- 
machela kierkegaardi (16). Other trunk features of 
Schinderhannes are characteristic of euarthropods, 
including “short-great-appendage” arthropods. 
These include the tergites, biramous trunk append- 
ages, and tail spine. 


Aysheaia 
Kerygmachela 
Opabinia 
Anomalocaris 
Schinderhannes 
Trilobite 
Canadaspis 


Rehbachiella 


[er 
esoesMg 


Nebalia 
Fuxianhuia 
Fortiforceps 


Leanchoilia 


epodojyyeng 


Jiangfengia 


Yohoia 


spodosyye 
e6epuedde-jeas6-yoys, 


Branchiocaris 
Haliestes 


Palaeopantopus 


eee te) 


Tachypleus 


Eurypterid 


Fig. 2. Cladogram; tree length, 87. Consistency 
index, 0.5402; retention index, 0.6552. (1) Peytoia- 
like mouth sclerites, terminal mouth position, lateral 
lobes, loss of lobopod limbs, and stalked eyes. (2) 
Great appendages. (3) Sclerotized tergites, head 
shield, loss of lateral lobes, and biramous trunk 
appendages. (4) Stalked eyes in front and loss of 
radial mouth. (5) Post-antennal head appendages 
biramous and antenna in first head position. (6) Free 
cephalic carapace, carapace bivalved, and two pairs of 
antennae. (7) Maxilla | and II. (8) Exopods simple oval 
flap. (9) Two pre-oral appendages and a multiseg- 
mented trunk endopod. (10) Post-antennal head 
appendages biramous and tail appendages fringed 
with setae. (11) Long flagellae on great appendage 
and exopods fringed with filaments. (12) Trunk 
appendages uniramous and eyes not stalked. (13) 
No posterior tergites. (14) Tail spines and chelicere/ 
chelifore on first head position. (15) Proboscis. (16) 
Six post-antennal head appendages. 


A cladistic analysis (17) places Schinder- 
hannes between Anomalocaris and other arthro- 
pods (Fig. 2). The short-great-appendage taxa are 
paraphyletic, and the monophyly of a taxon 
“Megacheira”’ (/) was not confirmed. The position 
of the short-great-appendage arthropods as stem 
lineage representatives of the Chelicerata, howev- 
er, is consistent with the majority of recent analy- 
ses (2, 3, 5, 18-22) and supports the interpretation 
of the great appendage as homologous with the 
chelicera of living chelicerates (/, 3—7). 

The discovery of Schinderhannes emphasizes 
the importance of exceptionally preserved de- 
posits (Konservat-Lagerstatten) in revealing the 
evolutionary history of arthropods. It shows that 
features of the giant Cambrian anomalocaridids 
survived for about 100 million years after the 
Middle Cambrian. The Hunsriick Slate also 
yields examples of Marrellomorpha (23), a clade 
well known from the Cambrian (24) and more 
recently discovered in exceptionally preserved 
fossil deposits from the Silurian and the Ordovi- 
cian (25, 26). Thus, the rarity of post-Cambrian 
great-appendage arthropods may be a result in 
part of the decline of Burgess Shale—type preser- 
vation after the Middle Cambrian (27). 
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Two Thresholds, Three Male Forms 
Result in Facultative Male 
Trimorphism in Beetles 


J. Mark Rowland?* and Douglas J. Emlen” 


Male animals of many species deploy conditional reproductive strategies that contain distinct 
alternative phenotypes. Such facultatively expressed male tactics are assumed to be due to a single 
developmental threshold mechanism switching between the expression of two alternative 
phenotypes. However, we discovered a clade of dung beetles that commonly expresses two threshold 
mechanisms, resulting in three alternative phenotypes (male trimorphism). Once recognized, we 
found trimorphism in other beetle families that involves different types of male weapons. Evidence 
that insects assumed to be dimorphic can express three facultative male forms suggests that we need 
to adjust how we think about animal mating systems and the evolution of conditional strategies. 


Iternative reproductive tactics in male 
Asn such as guards versus sneaks or 
callers versus satellites, have served as 
models for examining the evolution of threshold- 


mediated traits (e.g., polyphenisms). Thousands 
of empirical examples of alternative reproductive 
tactics (7), numerous theoretical models of thresh- 
old evolution (2-5), and many examples of poly- 
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phenic regulatory developmental mechanisms 
(/, 6) are known. These studies assume that two 
phenotypic alternatives are separated by a single 
regulatory threshold, which results in a dominant 
tactic used by the largest, healthiest, or highest 
quality males and an alternative, generally less- 
aggressive tactic adopted by subordinate males. 
Models for the evolution of these and other 
threshold traits incorporate a single threshold that 
yields two reproductive tactics. 

A few well-studied animal species contain 
three male tactics (male trimorphism), including 
alpha, beta, and gamma males in isopods (7) and 
fish (8); blue, yellow, and orange males in side- 
blotched lizards (9, 70); and independent, satellite, 
and faeder males in birds (//). In all of these spe- 
cies, male reproductive phenotypes are determined 
primarily by the inheritance of alleles of large ef- 
fect (1.e., the morphs behave like genetic polymor- 
phisms), rather than by facultative mechanisms 
incorporating thresholds. The evolution of male 
trimorphism has yet to be examined in the context 
of conditional alternative tactics and polyphenism. 

Because animal development is rife with 
thresholds and because the expression of complex 
morphological and behavioral phenotypes may 
be sensitive to many environmental or circum- 
stantial stimuli at multiple periods during devel- 
opment (6), there is no reason to expect that a 
single threshold should be the norm. We provide 
examples of beetles that incorporate two devel- 
opmental thresholds into the regulation of expres- 
sion of a single male trait (weapons), yielding 
three male forms (facultative male trimorphism). 

Dung beetles form a monophyletic clade of 
the Scarabaeidae that originated at least 65 
million years ago but radiated extensively with 
mammals (/2). More than 5000 species have 
been described worldwide, and many of these 
bear horns. Beetle horns are rigid outgrowths of 
the exoskeleton that function in intrasexual com- 
bat over access to resources. In most dung bee- 
tles, horns are confined to males, and males of 
many of these species are dimorphic, with dis- 
tinct major and minor phenotypes (/3, /4). Typ- 
ically, males larger than a threshold body size 
(major males) produce large horns. Males smaller 
than this threshold body size (minor males) have 
disproportionally smaller horns or are hornless. 
Male weapon morphology is determined facul- 
tatively, on the basis of an interaction between an 
intrinsic and heritable threshold and the nutrition- 
al history (environment) of the animal, resulting 
in maturation at a body size that is either above or 
below this critical threshold size (74-16). Major 
and minor male dung beetles have been found to 
use alternative reproductive tactics to mate with 
females (/7—19). Major males use their large 
body sizes and long horns to defend entrances to 
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burrows containing females, guarding these tun- 
nels from rival males; minor males sneak into 
these tunnels, either by slipping past the guard- 
ing male or by digging side tunnels that inter- 
cept tunnels beneath the guarding males. Thus, 
dung beetles appear to fit the traditional model 
of conditionally expressed alternative male tactics 
regulated by a single developmental threshold 


However, at least two qualitatively different 
conditional threshold mechanisms are now 
known to regulate horn expression in dung 
beetles. One mechanism (threshold mechanism 
1; Fig. 1A, left) appears to truncate production of 
the horn entirely, either by suppressing prolifer- 
ation in the cells that form the horns (20, 2/) or 
by reabsorbing horn tissue during metamorphosis 


mechanism. (22). Populations incorporating this regulatory 
Threshold mechanism 2 
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Fig. 1. Two threshold mechanisms regulate expression of horns in male dung beetles. (A) Horn 
length/body size scaling relationships for natural populations of adult males of Phanaeus igneus 
(left), P. triangularis (middle), and P. vindex (right) reveal these processes as an abrupt and size- 
dependent change in scaling relationship slope (threshold mechanism 1, left), a size-dependent shift 
in intercept (threshold mechanism 2, right), or a combination of the two mechanisms (middle). 
Because among-individual variation in body size in scarab beetles is influenced primarily by larval 
nutrition, species simultaneously incorporating both of these threshold mechanisms are facultatively 
trimorphic. From this, we define alpha (light blue), beta (dark blue), and gamma (green) male forms, 
which were discriminated by the likelihood method for normal distributions (25). (B) Evolution of two 
thresholds mapped onto a phylogeny of 22 species of phanaeine scarabs [tree topology from (34)], 
with the presence of each threshold mechanism (bold branches) estimated by parsimony (35). 
Species containing alpha, beta, and gamma male forms indicated as above. Facultative trimorphism 
appears to have been gained at least four separate times in the period covered by this phylogeny (red 
boxes). 
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mechanism can be identified by a dramatic switch 
in the slope of the scaling relationship (allometry) 
between horn length and body size. Individuals 
producing horns have horn lengths that scale posi- 
tively and often linearly with among-individual 
variation in body size. Individuals not producing 
horns yield flat scaling relationships when the 
corresponding parts of the head are measured. A 
different mechanism (threshold mechanism 2; 
Fig. 1A, right) modulates the total amount of 
horn growth, resulting in morphologically similar 
but disproportionally shorter horns. Populations 
incorporating this second threshold mechanism 
typically have horn length/body size scaling rela- 


Fig. 2. Male trimor- 
phism in three beetle 
families. (A) In scarabaeid 
dung beetles (Oxysternon 
conspiailatum), alpha and 
beta males both produce 
head horns, but the rela- 
tive sizes of these weapons 
differ (shift in intercept of 
the scaling relationship 
between horn length and 
body size, threshold mech- 
anism 2). Gamma males 
and females lack head 
horns entirely (change in 
scaling relationship slope, 
threshold mechanism 1). 
Alpha, beta, and gamma 
males were discriminated 
by the likelihood method 
for normal distributions 
(25). (B) In lucanid bee- 
tles (Odontolabis cuvera), 
male mandibles occur in 
three discrete anatomical 
conformations, which iden- 
tify the alpha, beta, and 
gamma male forms (al- 
so fig. $2). (C) In weevils 
[Parisoschoenus expositus; 
measures from (26)], males 
produce long ventral spines 
that are outgrowths of 
the sternum and flank a 
deep invagination of cu- 
ticle, the sheath. As with 
the dung beetles, alpha 
and beta males produce 
similar weapons that dif- 
fer in relative size (thresh- 
old mechanism 2), whereas 
gamma males lack weap- 
ons and resemble females. 
In P. expositus, gamma 
males and females lack 
both the ventral spines 
and the sheath (white ar- 
rows) used in male com- 
bat (26). 


tionships with similar (e.g., parallel) slopes but 
shifted y intercepts. Both types of threshold mech- 
anism have been shown to couple horn growth 
with nutrition in scarab beetles and are considered 
facultative (polyphenic) regulatory processes 
(14, 16, 23, 24), and preliminary evidence sug- 
gested that they might sometimes both be present 
in the same species (25). 

We looked at a taxonomic array of phanaeine 
dung beetles to determine whether both threshold 
mechanisms could cooccur within the males of a 
single species. Remarkably, we found that simul- 
taneous incorporation of both threshold mecha- 
nisms was common in these beetles and that male 
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trimorphism resulted from this cooccurrence. We 
were thus able to identify distinct alpha, beta, and 
gamma males within these species (Figs. 1 and 2). 
Additionally, we found that both temporal and 
geographic changes in population body sizes in 
trimorphic species produced changes in the rel- 
ative proportions of the three male morphs, as 
predicted for facultative threshold mechanisms 
(fig. S1, A and B). Our data suggest that facul- 
tative trimorphism evolved through the stepwise 
accumulation of independent threshold mecha- 
nisms (e.g., monomorphism — threshold mech- 
anism 2 — threshold mechanisms 2 + 1; Fig. 
1B). An alternative possibility that we did not test 
is that independent, dimorphic lineages with 
different threshold mechanisms hybridized to 
form trimorphic populations. Data from addi- 
tional species will be needed to test this evolu- 
tionary scenario. 

We observed male trimorphism in 5 of the 21 
species of phanaeine dung beetles studied, with 
striking trimorphism in several (Fig. 1 and 2). 
These results coupled with literature suggesting 
that isolated species of other insects may have 
three male forms (26—29) led us then to look for 
trimorphism in other families of beetles. Here 
again, a limited study identified male trimor- 
phism in the allied family Lucanidae (stag bee- 
tles) and also in the more distantly related 
Curculionidae (weevils). We found that trimor- 
phism is expressed in three different organ 
systems in these three families of beetles: head 
horns in Scarabaeidae, mandibles in Lucanidae, 
and sternal spines in Curculionidae (Fig. 2). It 
now appears that several groups long considered 
to be male dimorphic actually contain trimorphic 
species. 

We detected the occurrence of trimorphism in 
dung beetles by using phylogenetic reconstruction 
of the evolution of each of two developmental 
threshold mechanisms, scoring trimorphic taxa 
as having both mechanisms. Trimorphism in sev- 
eral of these species was evident in the nonlinear 
allometry of their head horns. However, none of 
the statistical methods used to detect thresholds 
(13, 25, 30, 31) was able to consistently identify 
more than a single threshold in these taxa, re- 
flecting the fact that they were designed to test for 
dimorphism, not trimorphism. In addition, we 
showed that trimorphism can occur in other groups 
of beetles without dramatic or even detectable 
inflections in allometry. For example, in lucanids 
allometry of mandible scaling alone reveals two 
male trajectories, yet we found that at least some 
lucanid species express three qualitatively dif- 
ferent male phenotypes on the basis of arma- 
ture of the mandibles (Fig. 2B and fig. S2). 
This suggests that in some taxa gamma male 
morphs will be difficult to detect with current 
methods and raises the possibility that gamma 
males are present but as yet undetected in many 
animal taxa. 

How are three male forms maintained in these 
populations? Studies of allelically trimorphic 
taxa may be especially informative because al- 
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pha, beta, and gamma males are well known to 
use discrete alternative mating behaviors (7—/0). 
In these species, negative frequency-dependent 
selection with intransitive fitness interactions 
among tactics (e.g., rock-paper-scissors) ap- 
pear to maintain trimorphism within populations 
(9, 10, 32). It is unknown how male reproductive 
behavior varies in species with facultative tri- 
morphism, and it will be important to determine 
whether similar fitness intransitivity of tactics 
applies in these cases. It is interesting that the 
smallest male forms (gamma males), at least in 
the beetle families studied so far, invariably re- 
semble females. This suggests that male repro- 
ductive tactics may include a dominant (fight/ 
guard) tactic, a subordinate (sneak) tactic, and a 
female-mimicry tactic. This is a striking parallel 
to allelically trimorphic fish (8), isopods (7), and 
birds (//) and would support predictions from 
recent rock-paper-scissors models that suggest 
that many taxa will contain cryptic (undetected) 
female-mimicking males (/0, 32). 

Recognizing that there are at least two distinct 
thresholds also affects studies of the evolution of 
these developmental mechanisms. It is already 
clear that there are many developmental routes to 
the polyphenic regulation of male weapon sys- 
tems in beetles, and, as these mechanisms are 
discovered and described, they are routinely 
compared across species (20-22, 33). We suggest 
that the most meaningful comparisons will be 
those that explicitly consider the type of threshold 
mechanism involved and treat these accordingly 
as distinct and evolutionarily independent pro- 
cesses. Our findings raise the possibility that 
horned male majors in species with threshold 


mechanism | (Fig. 1A, left) are actually, devel- 
opmentally, beta males, whereas the horned 
majors in species with threshold mechanism 2 
(Fig. 1A, right) are alpha males. Acknowledging 
this distinction can help us better understand the 
full complexity of their rich behavioral reper- 
toires, as well as more appropriately study the 
developmental and genetic architectures of their 
facultative regulatory mechanisms. 
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Sequential Sympatric Speciation 
Across Trophic Levels 


Andrew A. Forbes,?* Thomas H.Q. Powell,” Lukasz L. Stelinski,? James J. Smith,? Jeffrey L. Feder? 


A major cause for biodiversity may be biodiversity itself. As new species form, they may create 
new niches for others to exploit, potentially catalyzing a chain reaction of speciation events across 
trophic levels. We tested for such sequential radiation in the Rhagoletis pomonella (Diptera: 
Tephritidae) complex, a model for sympatric speciation via host plant shifting. We report that the 
parasitic wasp Diachasma alloeum (Hymenoptera: Braconidae) has formed new incipient species as 
a result of specializing on diversifying fly hosts, including the recently derived apple-infesting race 
of R. pomonella. Furthermore, we show that traits that differentially adapt R. pomonella flies to 
their host plants have also quickly evolved and serve as ecological barriers to reproduction, 
isolating the wasps. Speciation therefore cascades as the effects of new niche construction 


move across trophic levels. 


been inferred to explain current and past 

patterns of biodiversity by paleontologists, 
ecologists, and evolutionary biologists alike (/—3). 
However, this hypothesis of sequential radiation 
is difficult to directly test in nature. Examples such 
as adaptive radiations after mass extinctions (4), 
species richness in the tropics (/), and the in- 
creased diversity of insects having herbivorous 


T= idea that species induce speciation has 


life styles (5, 6) have mainly been investigated on 
the basis of phylogenetic inference and/or correl- 
ative analyses. 

Host plant—specific phytophagous insects and 
their parasites may be good candidates for testing 
the sequential radiation hypothesis (7). This is be- 
cause new resource opportunities become available 
when a plant-eating insect diversifies by shifting 
and adapting to a novel host plant, with its guild 


of associated parasites potentially following suit 
and speciating in kind. Unfortunately, a lack of 
historical and biogeographic information concern- 
ing host shifting and the absence of a free-living 
parasite life stage often complicate our under- 
standing of plant-insect-parasite systems. In these 
cases, cocladogenesis (cospeciation resulting from 
parallel allopatry of interacting organisms) rather 
than the cascading effects of shifting host ecol- 
ogy across trophic levels could trigger codiver- 
sification (8). One cannot rule out that insect 
and parasitoids became separated in tandem 
from other conspecifics and evolved into new 
species as a consequence of their shared physical 
isolation. 
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The Rhagoletis pomonella sibling species 
complex is a model for speciation in the absence 
of geographic isolation via host plant shifting. 
The natural history of these flies, including the 
recent sympatric host shift of the species R. 
pomonella from hawthorn to introduced, do- 
mesticated apple within the past 150 years (9), 
suggests that they did not evolve as a result of 
passive cocladogenesis. Therefore, we tested for 
sequential radiation of its specialist parasitoid wasp, 
Diachasma alloeum [see supporting online ma- 
terial (SOM) text for parallels between fly and wasp 
biology], by examining whether wasps attacking 
the ancestral hawthorn and the derived apple races 
of R. pomonella, as well as the closely related sib- 
ling species R. mendax (host: blueberry, Vaccinium 
spp.) and R. zephyria (snowberry, Symphoricarpos 
spp.), display patterns of host-related genetic varia- 
tion similar to those of the flies. D. alloeum only 
attacks R. pomonella complex flies found on these 
four host plants (0). We also investigated whether 
wasps differed by the same type of host plant— 
specific mating and diapause traits that are ecolog- 
ically isolating the flies. 

The pattern of genetic differentiation among 
D. alloeum populations was similar to that for 
R. pomonella flies (11, 12). Mitochondrial DNA 
(mtDNA) cytochrome oxidase I (COD) sequences 
displayed only modest host-related differentiation 
for the wasps (Fig. 1A, figs. S1 and S2, and tables 
S1 and S2). Therefore, the wasps are apparently 
of relatively recent origin and do not represent 
highly genetically diverged cryptic sibling spe- 
cies. However, we did find a mtDNA haplotype 
in snowberry wasps that was not found in the 
other wasp populations (fig. S2 and table S2). In 
addition, a common mtDNA haplotype in apple, 
hawthorn, and blueberry wasps was not present 
in the snowberry population. Thus, in a mtDNA 
genetic distance network, the snowberry wasp 
was offset from the other taxa (Fig. 1A). 

Nine of 21 microsatellite loci analyzed (DA003, 
DA013, DAO19, DA150, DA174, DA183, DA192, 
DA202, and DA205) displayed consistent host- 
related allele frequency differences between 
at least two of the wasp populations (table S3 
and SOM appendix), much like allozymes for 
R. pomonella flies (13, 14). Locus DA003 showed 
particularly pronounced frequency differences (see 
Fig. 2 for graph of combined DA003 196+200+204 
allele frequencies). Indeed, allele 196 was common 
in apple (mean frequency = 0.292; n = 291 total 
alleles scored), blueberry (0.179, n = 246), and 
snowberry (0.257, n = 101) wasp populations but 
absent from all hawthorn wasps (1 = 385 alleles 
scored; SOM appendix). The lack of allele 196 in 
hawthorn wasps for locus DA003 suggests that 
there is little or no effective interhost gene flow from 
apple, blueberry, or snowberry wasp populations into 
the hawthorn population. Neighbor-joining trees for 
the nine loci displaying host-related divergence 
(Fig. 1B) and the full 21-loci data set (fig. S3) sep- 
arated hawthorn and blueberry wasp populations at 
different ends of the networks. The lone snowberry 
wasp site analyzed was genetically most closely as- 


sociated with blueberry wasps, whereas the apple 
wasps were placed at an intermediate position 
between blueberry and hawthorn populations 
(Fig. 1B). 

We next investigated whether the same host- 
related adaptations ecologically isolating R. pomonella 
flies were responsible for genetically differentiat- 
ing the wasps. Because Rhagoletis flies mate on 
or near the fruit of their respective host plants 
(15-17), host choice generates prezygotic isola- 
tion and facilitates the evolution of host-specific 
performance traits. A field study of free-living adult 
D. alloeum in a blueberry patch in Fennville, MI, 
suggested that the wasps also use host fruit as a 
rendezvous site for mating. Before copulation, all 
male and female wasps that formed the 24 mating 
pairs recorded in the study were first observed flying 
toward blueberries from distances of 1 to 3 m away. 
In all cases, wasps flew close to blueberries, some- 
times making contact with the fiuit, before they 
initiated coupling on nearby host leaves [mean 
distance mating from fruit = 7.4 + 6.3 cm (SD), n = 
24, range = 3 to 30 cm]. The average time from first 
observation to mating was 172.2 + 94.7 s SD. 

Rhagoletis flies use the volatile compounds 
emitted from the surface of ripening fruit as ol- 
factory cues to both find and discriminate among 
host plants for mating and oviposition (/8). We 
thus tested whether wasps show similar discrim- 
inatory behavior for surface host fruit volatiles by 
using a y-tube olfactometer (fig. S4). We found 
that naive adult wasps from apple, hawthorn, and 
blueberry populations positively oriented to the arm 
of the y-tube containing their natal fruit odor and 
were antagonized by nonnatal volatiles (Table 1). 
There was no qualitative difference between males 


A mtDNA COI 


Haw Blue 


REPORTS 


and females in their behavioral responses to sur- 
face fruit volatiles. However, female hawthorn 
wasps displayed stronger preferences for natal 
hawthorn fruit volatiles (Fisher’s exact test, P = 
0.006, 1 df) and more pronounced antagonism to 
nonnatal apple volatiles (P = 0.030, 1 df) than 
male hawthorn wasps did. Snowberry wasps 
showed behavioral specificity for their natal fruit 
volatiles that was similar to that of the other para- 
sites, but too few snowberry wasps were available 
for testing to draw definitive conclusions (Table 1). 
All wasps exhibited behavioral antagonism to the 
volatiles of flowering dogwood fruit (Cornus florida), 
a host for an undescribed sister taxon to R. pomonella 
that D. alloeum does not attack (9). These data 
suggest that fruit odor discrimination may act in a 
similar manner in the wasps as in the flies and 
may generate host-specific mating, resulting in 
an ecological barrier to gene flow in sympatry. 
Because we cannot mate or rear D. alloeum for 
multiple generations in the laboratory, it is unknown 
whether maternal effects and/or larval condition- 
ing may contribute to genetically based fruit vol- 
atile discrimination. In the rare instances in which 
larval experience was inferred to influence an 
adult insect’s preference, the effect was mediated 
via direct larval contact with plant compounds (20). 
But, because D. alloeum has no direct contact with 
surface fruit volatiles at any stage of its life before 
adult eclosion, larval conditioning is unlikely. 
Diapause life-history differences represent a 
second critical ecological barrier to gene flow 
among Rhagoletis flies (21). Sympatric blueberry, 
apple, and hawthorn flies all eclose as adults at 
different times in the spring and/or summer, match- 
ing the timing of fruit ripeness on their respective 
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Fig. 1. Genetic distance networks depicting relationships of D. alloeum wasp populations attacking 
blueberry (blue), apple (green), hawthorn (red), and snowberry flies (black). (A) Fitch network created on 
the basis of Nei genetic distances of mtDNA haplotype frequencies in wasp populations. (B) Neighbor- 
joining Nei genetic distance network of the nine microsatellite loci showing consistent site-to-site host- 
related frequency differences. Black ovals give bootstrap confidence values (10,000 replicates). Site 
abbreviations are Migg = Grant, MI; MI, = Holland, Ml; Mi;, = East Lansing, Ml; Mley = Fennville, 
MI; Mipw = Dowagiac, Ml; ILyp = Urbana, IL; PAsc = State College, PA; NJcy = Chatsworth, NJ. See fig. $1 
and table $1 for a map and descriptions, respectively, of study sites. 
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host plants (Fig. 3). Because Rhagoletis flies have 
only one generation per year and live for about a 
month in the field, the differences in the timing of 
eclosion can allochronically isolate the flies. Lab- 
oratory rearing experiments and field studies of 
D. alloeum indicated that similar eclosion time 
differences exist among blueberry, apple, and 
hawthorn wasps (Fig. 3 and table S4). The lone 
exception was a late fruiting apple from the 
Dowagiac, MI site where apple wasps had a rel- 
atively late mean eclosion time of 112.9 + 1.01 
days (SE) (n = 102) that overlapped with that of 
hawthorn wasps (111.1 + 1.13 days, n = 44). 
Mean longevity of adult D. alloeum was deter- 
mined to be less than 2 weeks in the laboratory 
(12.9 + 0.4 days SE, n = 286). Assuming wasps 
have a similar lifespan in nature (a likely over- 
estimate), eclosion time differences may decrease 


seasonal overlap and mating opportunities be- 
tween blueberry and apple wasps at the sympatric 
Fennville, MI site by 28.9%, between apple and 
hawthorn wasps by 29.8%, and between blue- 
berry and hawthorn wasps by 74.5%. 

For R. pomonella, the allozymes displaying 
host-related frequency differences correlate with 
eclosion time (9), tying together the genetics of 
host race formation with a trait that allochroni- 
cally isolates the flies. We tested for similar rela- 
tionships for the microsatellite loci separately in 
male and female wasps because of sex-related 
differences in eclosion time (male wasps eclose 
several days earlier than females). For D. alloeum, 
the microsatellite loci were significantly corre- 
lated with variation in eclosion time for hawthorn 
(r° stepwise multiple regression for females = 
0.504, P < 0.0001, 52 df; r* males = 0.525, P< 
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Fig. 2. Combined 196+200+204 allele frequencies for locus DAOO3 for D. alloeum wasp populations 


attacking blueberry (blue sites), apple (green sites), 
United States. n is number of alleles scored for site. 


and hawthorn (red sites) arrayed by latitude in the 
See Fig. 1 legend for site abbreviations. 


Table 1. Percentages of increase (+ indicates preference) or decrease (—, antagonism or avoidance) in the 
behavioral orientation of populations of D. alloeum wasps to the arm of a y-tube olfactometer containing 
the indicated host fruit volatiles compared with blank control experiments. Control values (bottom row) 
denote the percentage of wasps orienting to either the right or the left arm of the y-tube in experiments 
when no odor was present in both arms. There was no directional preference for either arm of the y-tube in 
the blank control runs, and the remaining ~23% of wasps that did not orient right or left failed to move up 
the y-tube from their release point. “P < 0.01, ~ P < 0.001, and ‘not significant. 


Host fruit Apple wasps Blueberry Hawthorn Snowberry 
volatiles n=201 ee ia were 
n=110 n = 330 n=9 
Apple 454% 29% -18%" —14%' 
Blueberry 23% 442%” -28% —14%' 
Hawthorn 33% 34%” 444%” —14%' 
Snowberry 46% —11%' 17%” 0%" 
Dogwood 36% 53% 16%" 14%" 
Control 38% 39% 38% 39% 
6 FEBRUARY 2009 VOL 323 SCIENCE 


0.0001, 46 df), apple (r* females = 0.163, P = 
0.0097, 54 df 7° males = 0.657, P< 0.0001, 30 df), 
and blueberry (* females = 0.686, P < 0.0001, 
29 df 7? males = 0.511, P< 0.0001, 53 df; table S5) 
wasps. In particular, the locus DA003 was sig- 
nificantly correlated with eclosion time variation 
in blueberry wasps (* females = 0.378, P = 
0.0003, 29 df; * males = 0.511, P< 0.0001, 53 df) 
but only moderately in female apple wasps (77 
females = 0.101, P=0.0182, 54 df) and was not a 
significant predictor for hawthorn wasps. The 
alleles 196+200+204 at locus DA003 were asso- 
ciated with earlier eclosion times in both female 
and male blueberry and apple wasps (fig. SS). 
These three alleles were present in highest fre- 
quency for blueberry wasps (0.790, 1 = 205 total 
alleles scored), intermediate for apple (0.485, n = 
260), and lowest for hawthorn wasps (0.029, n = 
385), corresponding to the order of eclosion from 
earliest to latest for these populations (Fig. 3). 
Locus DA003 therefore may represent a naturally 
segregating, major effect quantitative trait locus 
for diapause that is associated with ecological repro- 
ductive isolation among D. alloeum populations. 
Our results imply that sympatric host shifts of 
R. pomonella onto new plants may have triggered 
a reciprocal and rapid starburst of adaptive radia- 
tion for its parasitoid, D. alloeum. We found that 
host-related ecological effects initiating specia- 
tion for Rhagoletis rippled through the commu- 
nity and may have amplified diversity for the wasp. 
Although we were unable to unambiguously re- 
solve the source of the recently formed apple wasp 
race in this study, the absence of the 196 allele at 
microsatellite locus DA003 in the hawthorn wasp 
race suggests that the hawthorn wasp race is not 
the sole progenitor of the apple wasps. Together 
with the mtDNA data, allele 196 and its association 
with early eclosion instead suggest that blueberry 
wasps may have given rise to the apple wasp pop- 
ulation. It is possible, however, that apple wasps 
are hybrids resulting from crosses between haw- 
thorn and blueberry wasps on the basis of both their 
intermediate position in the microsatellite distance 
network and their intermediate overlapping eclo- 
sion time. Regardless, our study demonstrates that 
the origin of the apple wasp was not a result of strict 
1:1 cocladogenesis with apple-infesting Rhagoletis 
flies. One intriguing possibility is that the wasps 
themselves may be promoting Rhagoletis host 
shifts. Lower parasitism rates for R. pomonella in 
derived hosts like apple [i.e., enemy-free space (22)] 
could favor race formation for the fly, creating new 
niches promoting wasp divergence. We must also 
determine whether non-host-related premating and 
intrinsic postzygotic isolation exist in the genus 
Diachasma. The sequential radiation hypothesis 
predicts that if nonecological barriers exist they 
should arise after host-related barriers evolve. 
We have presented one case study supporting 
sequential speciation. Other studies have identified 
potential examples (7), and, because D. alloeum 
is just one member of a guild of parasitic braco- 
nids attacking R. pomonella, it is possible that 
sympatric host races of Diachasmimorpha mellea 


www.sciencemag.org 


Downloaded from www.sciencemag.org on February 6, 2009 


CCUG Wry, 
e 


—#— Blueberry Flies 
X=56.7+1.0; n=30 

—eo— Apple Flies 
X=68.840.3; n=1335 

—**— Hawthom Flies 
X=71.540.6; n=283 


—o— Blueberry Wasps 
X=83.9+ 1.5; n=18 

—o— Apple Wasps 
X=96.6+ 1.6; n=41 

—v— Havwthom Wasps 
X=109.9+ 1.6; n=33 


Cumulative Frequency of Eclosion 


Me, Vv 
avvoFINUT TOON TTINTEEEEEETEETEEELT AE @ SSDNYNNYN NU 


No. of Days to Eclosion following 
Post-Winter Heating 


Fig. 3. Cumulative eclosion curves for blueberry-, apple-, and hawthorn-infesting Rhagoletis flies (solid shapes) 
and their associated D. alloeum wasp parasites (open shapes) at a sympatric field site near Fennville, MI. Mean 
times to adult emergence after overwintering (in days) and sample sizes (n) are given in the figure key. 
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Evolution of the Drosophila Nuclear 
Pore Complex Results in Multiple 
Hybrid Incompatibilities 


Speciation often involves the evolution of incompatible gene interactions that cause sterility or lethality 
in hybrids between populations. These so-called hybrid incompatibilities occur between two or more 
functionally divergent loci. We show that the nucleoporin 160kDa (Nup160) gene of the fruitfly 
Drosophila simulans is incompatible with one or more factors on the D. melanogaster X chromosome, 
causing hybrid lethality. Nup160 encodes a nuclear pore complex protein and shows evidence of 
adaptive evolution. Furthermore, the protein encoded by Nup160 directly interacts with that of another 
hybrid lethality gene, Nup96, indicating that at least two lethal hybrid incompatibility genes have 
evolved as byproducts of divergent coevolution among interacting components of the Drosophila 
nuclear pore complex. 


s species diverge from one another, they — ground but, when brought together in hybrids, can 
accumulate genetic substitutions that disrupt gametogenesis (causing hybrid sterility) 
function within their own genomic back- _ or development (causing hybrid lethality) (7, 2). 


The evolution and genetics of these hybrid incom- 
patibilities have been shown to follow specific 
rules. For instance, hybrid incompatibilities tend 
to accumulate gradually as species diverge (3); 
behave as partial recessives in hybrids (4); follow 
Haldane’s rule [that is, the preferential sterility or 
inviability of hybrids of the heterogametic (XY or 
ZW) sex (5)]; and accumulate disproportionately 
on the X chromosome (that is, the large X-effect) 
(6, 7). The molecular biology of hybrid incom- 
patibilities has revealed that five of six hybrid 
incompatibility genes identified so far show sig- 
natures of recurrent adaptive evolution (8—/3), 
which suggests that hybrid sterility and inviability 
generally evolve as incidental byproducts of posi- 
tive natural selection. 

To test whether this emerging molecular rule 
of speciation holds for additional hybrid incom- 
patibility loci, we performed a genetic screen 
for lethal hybrid incompatibilities between 
Drosophila melanogaster and D. simulans, two 
species that diverged ~3 million years ago. Be- 
cause all hybrids between D. melanogaster and 
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D. simulans are sterile, we performed a chromo- 
somal deletion (deficiency) screen to identify 
recessive lethal hybrid-incompatibility factors in 
the D. simulans autosomal genome in the F, 
generation (Fig. 1A) (/4). By screening ~70% 
of the D. simulans autosomal genome with ~200 
deficiencies, we identified 20 small regions that 
cause hybrid lethality when combined with a 
hemizygous D. melanogaster X chromosome 
(14, 15). In one of these 20 regions, we mapped a 
new hybrid lethality gene using chromosomal 
deficiencies from D. melanogaster to a region 
that includes 21 genes (Fig. 1B, and Table 1, lines 
1 to 5) and then performed single-locus com- 
plementation tests (table S1). Only one lesion, 
PBac{RB}R(C338""*, uncovers hybrid lethality 
(Table 1, line 6). To confirm that hybrid lethality 
is uncovered by the PBac{RB}R(C38°""™ mu- 
tation (a piggyBac transposon insertion) and not 
an unidentified linked mutation, we generated re- 
vertant chromosomes that were genetically identical 
to the original PBac{RB}R{C33°""* chromosome, 
except that the piggyBac insertion is precisely ex- 
cised (fig. S1). We recovered eight independent 
revertant chromosomes that lacked the piggyBac 
insertion (PBac{RB}R{C38°""""-R1 to -R8), and 
all were viable in hybrid males (table S2), con- 
firming that the piggyBac insertion unmasks 
hybrid lethality. 

To determine which gene (or genes) is dis- 
rupted by PBac{RB}R{C38°"""", we analyzed 
the genomic flanking sequence of the insertion 
(6) and found that the overlapping 3’ untrans- 
lated regions of two genes, CG4738 and RfC38, 
are disrupted (Fig. 1B). Of these two genes, 
CG4738 is the stronger candidate on the basis of 
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three observations: DNA sequences are more di- 
verged between species at CG4738 than at Rf/C38 
(58 fixed replacement differences versus 1, re- 
spectively); disruption of R{C38”” is not gener- 
ally associated with hybrid lethality because two 
other transposable element insertions that disrupt 
R{C38 but not CG4738 (PflacW}RIC38"3” 
and PBac{WH}R{C38”'”7) do not unmask hybrid 
lethality (table S1, lines 2 and 9); and CG4738 
encodes the predicted protein sequence of the Dro- 
sophila homolog of nucleoporin 160kDa (NUP160), 
a protein component of the nuclear pore complex. 
The latter finding is especially intriguing because 
we previously found that another nucleoporin 
gene, Nup96, causes hybrid lethality between these 
species (//). 

Although compelling, all three lines of evi- 
dence are circumstantial. We reasoned that if the 
D. simulans allele, Nup160%", causes recessive 
hybrid lethality, then introduction of the D. mel- 
anogaster allele, Nup160"", should rescue other- 
wise inviable hybrid males. To test this hypothesis, 
we cloned the Nup160"" cDNA into a vector 
that carries a mini-white* (w') eye-color marker 
(conferring red eyes) and a GAL4-inducible up- 
stream activation sequence (UAS) promoter (here- 
after, we refer to this construct as P/UAS-Nup160""}) 
(15). We transformed P{UAS-Nup160"”} into D. 
simulans w; Lhr flies and recovered four inde- 
pendent stocks with autosomal insertions of the 
P{UAS-Nup160"""} construct. We then crossed 
D. melanogaster PBac{RB}R{C38""""/CyO; 
P{Tub-Gal4}/TM3, Ser females carrying a ubiq- 
uitously expressed GAL4-driver of P{UAS- 
Nup160""'} expression with D. simulans w; Lhr 
P{UAS-Nup160"} males. These crosses produce 
four male zygotic genotypes in equal proportions 
(1:1:1:1), all of which inherited the P/UAS- 
Nup160"'? construct. As expected, the two kinds 
of hybrid males inheriting CyO are viable (Table 2, 


line 1), and hybrid males inheriting the PBac {RB} 
RFC38°""™ insertion but no induced Nupl60"” 
expression (that is, no P/Tub-Gal4}) were lethal 
(Table 2, line 1). However, in contrast to their 
PBac{RB}R{C380"""/sim; TM3/sim_ brothers, 
hybrid males inheriting the PBac{RB}R(C38°0"" 
insertion and induced Nup/ oon! expression, 
PBac{RB }R{C38" sim, P {Tub-Gal4 sim, were 
viable (Table 2, line 1). Moreover, the ratio of the 
three viable classes of hybrid males is not sig- 
nificantly different from 1:1:1 (y?=3.69, P=0.158). 
These data show that ubiquitous expression of 
D. melanogaster Nup160 rescues the hybrid lethal- 
ity caused by the insertion of PBac{RB}R(C38"™, 
and that D. simulans Nup160 is a lethal hybrid- 
incompatibility gene. 

To confirm that Nup/60"” is incompatible with 
one or more factors on the D. melanogaster X 
chromosome, we switched the species origin of the 
X in hybrids. We constructed a D. melanogaster 
C(DA, yIY; PBac{RB }R{C35"”"'/CyO stock, which 
possesses two X™*! chromosomes fused to a sin- 
gle centromere that are transmitted together, and 
crossed females from this stock with D. simulans w; 
Lhr males. These crosses produce progeny with the 
same autosomal genotypes (and cytoplasm) as 
those in Fig. 1A, except that hybrid males inherit 
their X from D. simulans, whereas hybrid females 
inherit both X chromosomes from D. melanogaster 
(fig. $2). [Attached-X™" hybrid females also in- 
herit a Y"™ chromosome, but the Y does not de- 
termine sex in Drosophila and is not essential for 
viability in these species (/7).] These crosses 
show that, whereas X”"°"/Y¥"" PBac{RB}R{C33 0" 
sim hybrid males are lethal (Table 1, line 6), ¥”"/ 
Y"". PBac{RB}R{C38"""*/sim_ hybrid males 
are viable (Table 1, line 7; Fisher’s exact test, 
P < 10°). We also found that although 1" 
X°" PBac{RB}R{(C38 “sim hybrid females 
are viable (Table 1, line 6), attached-X""!, C(1)A, 


Fig. 1. (A) Deficienyy A mel = x —sim__Lhr B 
mapping of recessive X- |. —— ° " aR: +7 
autosome hybrid-lethal cyO ae Lar EBs iH 
interactions in F, hybrids | | | | | | | 
between D. melanogas- E IlFI-ll-IDI-l- 
ter (mel, gray) and D. 
simulans (sim, white). seg — = — ] oe 
The sex chromosomes 
(left, Y chromosome with ee a 
hook) and one representa- —-—— ——| : : 
EEE 


tive pair of autosomes 


(right) are shown, where Df is the autosomal deficiency chromosome 
and CyO is the balancer chromosome. D. melanogaster Df/CyO 
females were crossed with D. simulans Lhr males; Lhr is a mutation 
that rescues ordinarily lethal hybrid males. Of the four zygotic 
genotypes produced, only Df-bearing hybrid males are forced to 
develop using only D. simulans material for the autosomal region 
exposed by the deficiency and only D. melanogaster material for the 
X chromosome. Thus, if only Df-bearing hybrid males die, we infer 


11070k pet pet 11100k A110 ii. 11140k pect 
coLsois Ast-C CG14920CC16854 = Ge-1 1¢ 2>get Gr32a co1s921 RFC3B G31867 006287 
So | c B| Cc i= a a a Sa3 
ce14915 068705 co6is2 CG6201 066230 Csl¢ co4751 
. C > ] 1] < - | Ed 


that a D. simulans autosomal region (red) is incompatible with D. 


melanogaster X chromosome, causing lethality [for more details, see (24)]. The 
hybrid-lethal interactions identified in this screen would normally afflict 
advanced-generation (for example, F2) but not F, hybrids because both 
partners in these X-autosome incompatibilities are recessive. (B) Fine-scale 
mapping results in region 32D on chromosome arm 2L. Three deficiencies fail 


O73 
SS> 


to complement hybrid lethality in males (black bars): Df(2L)FCK-20, Df(2L) 
Exel6027, and Df(2L)BSC36; two deficiencies complement hybrid lethality in 
males (gray bars): Df(2L)Exel6028 and Df(2L)Exel6029. Together, these 
deficiencies implicate one of 21 genes in hybrid lethality that maps to a 
89.6-kb region in the cytological interval 32D02;32D05. 
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y/¥", PBac{RB }R{C35 “sim hybrid females 
are lethal (Table 1, line 7; Fisher’s exact test, P< 
10 °°). These results show, respectively, that 
Nup160°"" causes hybrid lethality via an incom- 
patible epistatic interaction with one or more fac- 
tors on the D. melanogaster X chromosome and 
that Nup160°""-dependent hybrid lethality occurs 
in both sexes. 

To study the evolution of Nup160, we sur- 
veyed DNA sequence polymorphism and diver- 
gence within and between D. melanogaster and 
D. simulans from 12 isofemale lines of each spe- 
cies collected from Zimbabwe (/5). We tested the 
neutral mutation-drift model of protein evolution 
with the McDonald-Kreitman (MK) test (/8). 
The MK test rejects the neutral hypothesis for 
Nup160 and instead reveals a highly significant 
excess of fixed replacement differences between 
species (Table 3, line 1), consistent with recurrent 
adaptive protein evolution. To distinguish whether 
adaptive evolution at Nup/60 occurred during the 
history of one or both species, we polarized substi- 
tutions along lineages by using sequence data from 
the outgroup species, D. yakuba, and contrasted 
polymorphic and fixed mutations in the D. melano- 
gaster and D. simulans lineages separately. These 
lineage-specific MK tests showed that Nup/60 has 
experienced adaptive evolution in both lineages 
(Table 3, lines 2 and 3). 

Despite its history of recurrent positive selec- 
tion, we found no evidence for a recent selective 
sweep at Nup/60 in either the D. melanogaster or 
D. simulans \ineages. Selective sweeps should 
reduce local nucleotide diversity (79, 20) and shift 
the distribution of allele frequencies toward an 
excess of both rare variants (2/) and high- 
frequency derived variants (22). We found that 
mean silent nucleotide diversity, 1, at Nup/60 is 
similar to other autosomal loci sampled from 
African populations of these species (23, 24): 1 = 


0.0160 in D. melanogaster, and m = 0.0335 in 
D. simulans. Moreover, coalescent simulations 
show that neither Tajima’s D (2/) nor Fay and 
Wu’s H (22), two summaries of the allele frequency 
spectrum, deviate significantly from standard neu- 
tral equilibrium expectations in either species (D. 
melanogaster, D = -0.511 and H = —14.273; D. 
simulans, D = 0.134 and H = 7.455; P> 0.124 in all 
tests). These findings suggest that recurrent positive 
selection drove sequence evolution at Nup/60, but 
that none of the selected substitutions occurred re- 
cently (that is, within the past ~0.1 N.. generations, 
where JV, is the effective population size). 

Our current and previous (//) findings show 
that two rapidly evolving autosomal genes from 
D. simulans, Nup160"" and Nup96"”, are incom- 
patible with one or more factors on the D. melano- 
gaster X chromosome. NUP160 and NUP96 are 
members of the NUP107 subcomplex, a subset 
of interacting nucleoporins that together form a 
stable architectural component of the nuclear pore 
complex (NPC), the macromolecular structures that 
mediate all molecular traffic between the nucleus 
and cytoplasm (25). The structure, function, and in- 
teractions among particular nucleoporins seem 
largely conserved among eukaryotes (26). In addi- 
tion to NPC functions, members of the NUP107 
subcomplex also contribute to kinetochore function 
(27), cell cycle progression (28), and dosage com- 
pensation in Drosophila males (29). In D. melano- 
gaster, three other Nup107 subcomplex genes 
(including Nup96) have histories of adaptive evo- 
lution, similar to that observed here for Nup160, 
whereas in D. simulans six others have histories of 
adaptive evolution (30). The components of the 
dosage compensation complex (3/, 32) and its X 
chromosome-binding sites have also evolved by 
positive selection between D. melanogaster and 
D. simulans (33). However, it is unlikely that the 
rapid evolution of Nup160, and other members 


Table 1. Interspecific complementation mapping of a hybrid lethal factor on D. simulans 


chromosome arm 2L. 


D. melanogaster maternal genotype Cytological breakpoints 


Hybrid females Hybrid males 


-/sin CyOlsim -—/sim CyO/sim 


Df(2L)FCK-20/CyO 32D1;32F1-3 471 504 0 88 

Df(2L)Exel6027/CyO 32D2;32D5 338 296 1 261 
Df(2L)BSC36/CyO 32D1;32F3 134 137 0 76 

Df(2L)Exel6028/CyO 32D5;32E4 204 183 151-127 
Df(2L)Exel6029/CyO 32E4;32F2 265 274 254 ~=192 
PBac{RB}R(C38 0" °*/CyO 32D4 359 306 ) 219 
C(DA, y; PBac{RB}R(C338°°"”°*/CyO 32D4 ) 93 114 120 


REPORTS 


of the Nup107 subcomplex, reflects coevolution 
with the dosage compensation pathway or that 
hybrid inviability involves a disruption of dosage 
compensation in hybrids. For one, the rapid evo- 
lution of dosage compensation is largely limited 
to the D. melanogaster lineage (3 /—33) and thus 
cannot explain the especially rapid evolution of 
Nup160 and the other Nup107 subcomplex genes 
in D. simulans. For another, Nup160%" kills 
xmelpysin hybrid males and xmel xmelpysim hy- 
brid females, excluding a loss-of-function dis- 
ruption of dosage compensation—a male-specific 
problem—as the cause of hybrid lethality. Instead, 
the recurrent bouts of adaptation of Nup107 sub- 
complex genes are most likely driven by selection 
arising from evolutionary conflict involving path- 
ogens, retrotransposons, or meiotic drive elements 
(30, 34). 

As part of the NUP107 subcomplex, NUP160 
and NUP96 interact directly (25), raising the pos- 
sibility that Nup160"”" and Nup96"” are incom- 
patible with the same X-linked factor(s) from 
D. melanogaster. The D. melanogaster allele 
Nup153”*' is a strong candidate X-linked partner 
for both Nup96"” and Nup160"”": Nup153 has a 
history of adaptive evolution (30), its protein inter- 
acts physically with the NUP107 subcomplex (35), 
and it is the only known interacting nucleoporin 
encoded on the X chromosome. However, it is not 
clear whether Nup160 and Nup96 are involved in 
two distinct two-locus hybrid incompatibilities with 
the D. melanogaster X or whether they are compo- 
nents of one complex hybrid incompatibility (that 
is, one involving more than two loci). Theory 
predicts that complex hybrid incompatibilities should 
evolve more readily than simple hybrid incompat- 
ibilities (36), and there is the suggestion that complex 
hybrid incompatibilities may be typical [for exam- 
ple, (9, 37)]. Regardless of whether nucleoporin- 
based hybrid lethality has a simple or complex 
basis, our study suggests that the adaptive co- 
evolution of a large multi-protein complex may 
have given rise to multiple hybrid-incompatibility 
genes. These findings suggest that divergent co- 
evolution among the interacting partners of macro- 
molecular complexes, particularly those prone to 
evolutionary conflicts, may drive the evolution of 
molecular incompatibilities that contribute to 
speciation. 
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Queen Ants Make Distinctive 
Sounds That Are Mimicked 
by a Butterfly Social Parasite 


Francesca Barbero,”’? Jeremy A Thomas,”2* Simona Bonelli,” 


Emilio Balletto,? Karsten Schénrogge? 


Ants dominate terrestrial ecosystems through living in complex societies whose organization is 
maintained via sophisticated communication systems. The role of acoustics in information 
exchange may be underestimated. We show that Myrmica schencki queens generate distinctive 
sounds that elicit increased benevolent responses from workers, reinforcing their supreme social 
status. Although fiercely defended by workers, ant societies are infiltrated by specialist insects that 
exploit their resources. Sounds produced by pupae and larvae of the parasitic butterfly Maculinea 
rebeli mimic those of queen ants more closely than those of workers, enabling them to achieve 
high status within ant societies. We conclude that acoustical mimicry provides another route for 
infiltration for ~10,000 species of social parasites that cheat ant societies. 


he main attribute that enables ants to 
[evn in most terrestrial ecosystems 

is their ability to live in complex societies 
whose cohesion is regulated by highly devel- 
oped communication systems (/). Information is 
primarily transferred through exchanging dis- 
tinctive semiochemicals, which, combined with 
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physical contact, initiate and integrate different 
behaviors by colony members, providing a mech- 
anism for caste determination and for recogniz- 
ing and behaving appropriately to nestmates, 
non-kin ants, and potential intruders (/—3). In 
addition, the adults in 4 of the 11 subfamilies 
stridulate by scraping a plectrum located on an 
anterior segment of the abdomen (post-petiole) 
across a file (pars stridens) on the first segment 
of their gaster (/, 4). 

About 10,000 other species of invertebrates 
from 11 orders have evolved adaptations to in- 
filtrate ant societies and feed as social parasites 
on the rich resources concentrated inside nests 
(/, 5). Achieving this penetration generally in- 
cludes the corruption through mimicry of their 


host’s communication systems. The lycaenid but- 
terfly Maculinea rebeli is among the better- 
understood examples. After briefly feeding on 
gentians, its final instar larvae are carried by 
Myrmica schencki workers (in Western Europe) 
into their nest, where the butterflies acquire 
~98% of their ultimate biomass before pupating 
11 to 23 months later (5, 6). Inside the brood 
chambers, M. rebeli caterpillars beg like ant lar- 
vae and secrete semiochemicals that so closely 
mimic the surface hydrocarbons on Myrmica 
schencki larvae (7) that they are fed directly 
with regurgitations by the workers (6). How- 
ever, neither begging nor chemical mimicry ex- 
plains the high rank achieved by M. rebeli 
within its host’s social hierarchy. For example, 
inert dummies painted with the surface pher- 
omones of kin ant larvae or workers are re- 
trieved in preference to dummies painted with 
M. rebeli’s mimetic allomones (8), yet living 
M. rebeli larvae are rescued in preference to 
ant larvae when a colony is disturbed (9). Fur- 
thermore, nurse workers kill and feed their 
own brood to the social parasite if food is scarce 
(6, 10). We even observed Myrmica schencki 
queens treat Maculinea rebeli larvae or pupae 
like rivals (//) (fig. S1), whereas the workers reg- 
ularly treat them like royalty (6). Lacking other 
cues, we speculated that the butterfly’s elevated 
status might be achieved through mimicry of 
Myrmica schencki acoustics. Whereas Myrmica 
larvae are mute, distressed Maculinea larvae gen- 
erate sounds that resemble the alarm stridula- 
tions of (adult) Myrmica workers (12). To be 
fully adaptive, the parasite would need to em- 
ulate the typical stridulation patterns of the most 
valued colony members, the queens. This, in 
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turn, presupposes that queen stridulations differ 
from those of workers. 

The substrate-borne (/3) or airborne (/4) calls 
of ants are generally considered to be a weakly 
developed means of communication (/), merely 
signaling alarm and the location of buried in- 
dividuals, enhancing chemical recruitment, or 
(in Pogonomyrmex) indicating an end to mating 
(, 3, 15-17). Two studies, however, suggested 
that different castes might produce distinctive 
signals: The major workers of Atta cephalotes 
make sounds that are more intense and carry 


further than those of their smaller nestmates (/6), 
and the space between the ridges of the pars 
stridens of queens exceeds that of workers in four 
Messor species (18). 

Here, we report that the stridulatory organ 
of Myrmica schencki shows a morphological 
distinction between workers and queens greater 
than those seen in Messor, with the queen pos- 
sessing a 44% longer pars stridens (P = 0.04) 
and a 33% wider gap between the ridges (P < 
0.001) (Fig. 1). We recorded the signals produced 
by unstressed workers and queens (//) [support- 


REPORTS [ 


ing online material (SOM) audio S1 and 82] and 
found that the sounds of MZ schencki queens 
differ significantly from those of workers in their 
dominant frequency (Fig. 1, A and B) (P = 
0.014) and overall acoustics (Fig. 2) (P 
0.035), although the pulse lengths and the pulse 
repetition frequencies (PRFs) of both castes are 
similar (Fig. 1). The first two attributes reflect 
the morphology of the stridulatory organs, 
whereas the PRF reflects the rhythm and speed 
with which the plectrum is scraped across the 
file. 


A Myrmica schencki 
Queen 


Distance (um) 1.93 +0.35 
Length (um) 169.58 + 14.70 


plectrum 


Pulse length (s) 0.021 + 0.001 
PRF (s*') 34, 50 + 2.53 
Frequency (Hz) 812.2 + 40.1 


Fig. 1. Location, morphology, and sounds of the acoustical organs of (A) 
queens and (B) workers of the ant Myrmica schencki and (C) pupae and 
(D) larvae of its social parasite Maculinea rebeli. Measurements of the 
stridulatory file are the average distance between the parallel ribs and of 
length along the central line. Characteristic sound spectra are shown with 
measurements + SE of the average pulse length (PL), PRF, and dominant 
frequency (DF). Linear mixed effect models show that pupal and larval 


B Myrmica schencki 


Distance (um) 1.45 + 0.37 
Length (um) 117.65 + 29.55 


plectrum 


Pulse length (s) 0.021 + 0.002 
PRF (s"') 
Frequency (Hz) 1132.0 + 105.0 


Worker Pupa 


plectrum 


—. 


39.03 + 3.37 PRF (s*) 


(LR = 0.81). 


C Maculinea rebeli 


Pulse length (s) 0.040 + 0.005 
12.84 + 1.02 
Frequency (Hz) 531.0 13.5 


D Maculinea rebeli 
Larva 


acoustical signal 


Pulse length (s) 0.035 + 0.005 
PRF (s*') 13,32 + 1.40 
Frequency (Hz) 404.3 + 39.2 


sounds differ from sounds from queens and workers [likelihood ratio (LR) = 
12.63, P = 0.0004 for PL and LR = 20.67, P = 0.0005 for PRF]; the DF of 
worker sounds differs from those of queens (LR = 10.16, P = 0.0014), pupae 
(LR = 16.65, P = 0.0001), and larvae (LR = 34.37, P = 0.0002); whereas the 
DF of queens is more similar to pupae (LR = 4.12, P = 0.042) than to larvae 
(LR = 13.43, P = 0.0002). Pupal and larval sounds do not differ significantly 
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A ant models: Myrmica schencki worker @ 


Myrmica schencki queen @ 
Maculinea rebeli larva © 
Maculinea rebeli pupa © 


butterfly mimics: 


Pulse length 


Fig. 2. Comparisons of the overall acoustics of Myrmica schencki queens and 
workers and Maculinea rebeli pupae and larvae. (A) Three-dimensional plot of 
the three sound parameters for each category of insect; large orbs indicate 
mean values. (B) The combined effect of the three parameters shown as the 
first and second component plot of a principal components analysis over all 
individual measurements; ellipses indicate 95% confidence intervals; large 
circles, the centroid for each source. Pairwise tests in an analysis of sim- 
ilarities of normalized euclidean distances, with measurements nested within 
individuals, showed that workers differ from queens (R = 0.13, P = 0.035), larvae (R = 0.58, P < 0.001), and pupae (R = 0.64, P = 0.001). Queens also differ 
from larvae (R = 0.62, P < 0.001) and pupae (R = 0.90, P = 0.001). There is no significant difference between larval and pupal sounds (R = 0.02). Mean larval 
and pupal calls are significantly closer (P < 0.0001 and P < 0.001, respectively) to queen sounds than to workers. 


100 


Antagonistic 
behavior 


We played the sounds of both castes to un- 
disturbed laboratory nests of M. schencki work- 
ers, together with controls consisting of a third 


Benevolent social behaviors shown to objects 
of increasing value (—+) to the ant society 


miniature speaker that played computer-generated 454 b nana 
white noise and a fourth that was silent (7/7). No Stembuss 

oe : 44 @ Worker ant 
antagonistic or alarmed behavior was observed, i : 
except for a small repulsion of workers by white 354 =e 
noise (F's 66 = 4.33, P = 0.002) (Fig. 3 and table gut z 


é : Bi Larva butterfly 
S1). In contrast, the ant acoustics resulted in work- 


ers aggregating nonaggressively around the speak- 
er, then tapping it with their antennae (antennation) 
or standing motionless on its surface in a posture 
similar to that adopted when they attend queens 
and objects of high value to their society (on- 
guard attendance) (Fig. 3 and table S1). The sounds 
of workers and queens elicited similar amounts 
of antennation, the main functions of which are 0 
to induce worker-worker recruitment, to smell 
nestmates, and to facilitate oral exchanges of food 
or pheromones (/). However, queen sounds in- 
duced significantly higher occurrences of on-guard 
attendance than did worker calls (posthoc test, 
least significant distance for workers—queens P = 
0.014) (table S1), consistent with the exalted 
status and protection afforded to queens in the 
hierarchy of a colony (/). Combining these re- 
sults, we suggest that acoustical communication 
within the vast subfamily Myrmicinae, to which 


254 


054 


Mean worker ant responses min?°+ s.e. 


Antennate On guard attendance 


Repel Aggregate 


Worker ant behavior 


Fig. 3. Behavioral responses of Myrmica schencki colonies (10 workers) to sound recordings of 
M. schencki workers, M. schencki queens, Maculinea rebeli pupae, and M. rebeli larvae and to two 
controls (white noise and silence). Benevolent behaviors are in order of increasing respect, culminating 
in on-guard attendance. A significant overall difference in responses occurred within all four behaviors 
(repel Fs, 66 = 4.33, P = 0.002; aggregate F; 46 = 4.82, P = 0.001; antennate Fs 6g = 20.20, P < 0.001; 
on-guard Fs, 66 = 8.64, P < 0.001). The letters above each column indicate pairwise posthoc tests of least 
significant differences (P < 0.05): The same letter indicates no significant difference within each type of 
behavior; different letters indicate significantly different responses (table S1). 


Messor and Myrmica schencki belong, is more 
variable and conveys more social information 
within ant colonies than has previously been rec- 
ognized. They also fulfill the strict adaptationist 
definition of biological communication, in which 
both the signal and response are adaptive (/9). 
We next recorded the sounds made by Myrmica 
schencki’s host-specific social parasite, Maculinea 


rebeli, and, unlike an earlier study of Maculinea 
acoustics (/2), we recorded pupae as well as lar- 
vae and did not stress the ants or butterflies (//). 
In contrast to the ants, pupal calls were generated 
by tooth-and-comb stridulatory organs (Fig. 1C 
and fig. S2) situated in the grooves between ab- 
dominal segments 5 and 8. We are unsure of the 
source of larval sounds, which may emanate 


from muscular contractions in the abdomen (20) 
or from sclerotized structures between segments 
4 and 7 (Fig. 1D and fig. S2) that resemble the 
stridulatory organ of the mutualistic lycaenid 
Arhopala madytus (21). Despite the dissimilar 
sources, pupal and larval sounds resembled 
those made by the queens and workers of their 
host (Fig. 2A), but the similarity was 23% and 
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27% closer, respectively, to queen sounds than 
to those of workers (Fig. 2B) [mean normalized 
euclidean distances between individual butter- 
flies’ and ants’ sounds are as follows: pupa-queen 
2.47 + (SE) 0.10, pupa-worker 3.03 + 0.15, t = 
3.14, df 87, distance upaqueens < distances ypa-workerss 
P < 0.001; larva-queen 2.52 + 0.11, larva-worker 
3.21 + 0.12, t= 4.32, df 237, distancejaya-queens < 
distancejava-workerss  < 0.001]. The distributions 
in Fig. 2 also satisfy the concept that the perfect 
mimic should have maximal overlap with queen 
acoustics and minimal overlap with those of 
workers. 

Playing recordings of Maculinea pupal calls 
to the same naive cultures of Myrmica schencki 
workers resulted in enhanced benevolent responses 
similar to those elicited by queen ant sounds. We 
found no significant differences toward Maculinea 
pupal and Myrmica queen calls in any of the four 
behaviors scored, and pupal calls elicited six times 
more instances of royal on-guard attendance than 
occurred when worker sounds were played (Fig. 3 
and table S1) (P < 0.001). Recordings of M. rebeli 
larvae induced lower worker responses and, de- 
spite eliciting 2.3 times more on-guard attendances 
than worker calls, did not differ significantly from 
responses toward worker sounds (Fig. 3 and table 
S1). We did not play Maculinea calls to queen 
ants but predict that they would provoke rivalry 
similar to that observed when live Maculinea 
pupae were artificially enclosed with Myrmica 
schencki queens (11) (fig. $1). 

We suggest that regional host specificity in 
Maculinea populations is mediated first through 
chemical mimicry (6, 22); but once the intruder 
is admitted and accepted as a member of a host 
society, it mimics adult ant acoustics (particu- 
larly queens) to advance its seniority toward the 


highest attainable position in the colony’s hier- 
archy. Selection for accurate acoustical mimicry 
may have been stronger in pupae, which lack the 
main secretory organs of M. rebeli larvae and of- 
fer only weak rewards to tending workers. 

The young stages of other Maculinea species 
make similar pulsed sounds to M. rebeli (12): 
All differ substantially from those of other studied 
Lycaenidae, most of which are commensals or 
mutualists or have no known relationship with ants 
(12, 23-27). None of the latter mimics the acous- 
tics of associated ants in obvious ways, although 
the sound of one strongly mutualistic species attracts 
workers (23-26). Thus, the use of acoustics to sig- 
nal superior status to ants is unlikely to be a basal 
trait in the Lycaenidae, although we might expect 
it in Phengaris, the sister genus to Maculinea. 

Beyond the Lycaenidae, ~10,000 species of 
ant social parasites may exist (5), particularly 
among other Lepidoptera, Coleoptera, Diptera, 
and inquiline ants (/, 6). If acoustics plays the 
role that we suggest in reinforcing an ant’s hi- 
erarchical status, it seems likely that this cue has 
evolved in other social parasites to infiltrate and 
exploit their societies. 
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Stability Predicts Genetic Diversity in 
the Brazilian Atlantic Forest Hotspot 


Ana Carolina Carnaval,** Michael J. Hickerson,” Célio F. B. Haddad,” 


Miguel T. Rodrigues,* Craig Moritz” 


Biodiversity hotspots, representing regions with high species endemism and conservation threat, 
have been mapped globally. Yet, biodiversity distribution data from within hotspots are too sparse 
for effective conservation in the face of rapid environmental change. Using frogs as indicators, 
ecological niche models under paleoclimates, and simultaneous Bayesian analyses of multispecies 
molecular data, we compare alternative hypotheses of assemblage-scale response to late 
Quaternary climate change. This reveals a hotspot within the Brazilian Atlantic forest hotspot. 
We show that the southern Atlantic forest was climatically unstable relative to the central region, 
which served as a large climatic refugium for neotropical species in the late Pleistocene. This sets 
new priorities for conservation in Brazil and establishes a validated approach to biodiversity 
prediction in other understudied, species-rich regions. 


to shape present-day diversity in temper- 
ate and boreal systems (/), providing a 
general context for understanding current pat- 
terns of endemism. In the tropics, Pleistocene 


L: Quaternary climate fluctuations helped 


refugia models have been dismissed because of 
conflicting evidence (2, 3) or circularity in iden- 
tifying putative refugia (4), but historical pro- 
cesses must be invoked to explain regions of 
high endemism (5, 6). Recent studies from sub- 


tropical biomes have usefully employed post hoc 
palaeoclimate models of species and habitats to 
provide insights about processes shaping genetic 
and species diversity (5, 7). Building on them, we 
first map the palaeodistribution of endemic 
species to identify temporally stable (refugial) 
and unstable (recently colonized) regions for 
species occurrence, which are then validated with 
multispecies molecular data. Going beyond the 
traditional species-by-species approach, the mo- 
lecular analyses contrast the fit of assemblage- 
level data to the spatially explicit demographic 
scenarios suggested by the climate-based models. 

We apply this approach to one of the world’s 
most species-rich, yet notoriously endangered and 
understudied ecosystems: the Brazilian Atlantic 
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rainforest. Originally extending for 1,300,000 km? 
along the Brazilian coast and reaching into 
Paraguay and Argentina, this biome has been 
reduced to less than 8% of its range (8). Today’s 
fragments harbor one of the largest percentages 
of endemic species in the world, with many 
species and even genera of vertebrates still being 
described (8, 9). Our ultimate goal is to pinpoint 
regions for inventory work and habitat protection 
before we lose a substantial fraction of described 
and undocumented diversity. The approach dif- 
fers from previous methods by directly modeling 
historical processes, as opposed to observed bio- 
diversity patterns (/0), with the aim of informing 
conservation. 

We use molecular genetic data from multiple, 
largely codistributed species to test whether spa- 
tial modeling of species-specific Late Quaternary 
refugia sheds light on historical processes and 
hence improves prediction of genetic endemism 
and diversity in tropical Brazil (//). We focus on 
three common species of tree frogs that are 
widely distributed along the Brazilian Atlantic 
forest: Hypsiboas albomarginatus, H. semilineatus, 
and H. faber. Given their life history traits, 
amphibians are useful indicators of environ- 
mental changes through time (/2). Whereas H. 
albomarginatus and H. semilineatus occur in low 
and mid altitudes and are mostly restricted to the 
evergreen or semideciduous components of the 
Atlantic Forest in eastern Brazil, H. faber has a 
broader altitudinal range and also inhabits mixed 
and deciduous areas, occupying interior and coast- 
al sites in the Atlantic Forest south to Paraguay and 
Argentina (figs. S1 and S2) (3). The comparative 
phylogeographic approach is a powerful test of 
assemblage-scale responses to former environ- 
mental change and thereby provides a means for 
critical assessment of the scenarios produced by 
modeling of species’ distributions under palaeo- 
climates (7). 

The palaeomodeling method intersects pre- 
dicted species’ distributions under current condi- 
tions and climatic extremes of the Late Quaternary 
(6000 years before present, or 6 kybp, and 21 kybp) 
to predict areas of stability (regions in which 
species are predicted to occupy irrespective of 
time period) and unstable areas (7, /4). Because 
the stability maps raise specific hypotheses about 
regional differences in persistence and hence di- 
versity, they lead to phylogeographic predictions 
for both individual species and assemblages (co- 
distributed taxa; Fig. 1). Field sampling is driven 
by the model predictions to cover both predicted 
refugia and unstable (recently colonized) areas, 
particularly emphasizing previously undersam- 
pled areas. If the approach correctly predicts cur- 
rent patterns of biodiversity at the regional scale, 
species should consistently show (i) higher genetic 
diversity within and among populations in refugia 
relative to unstable areas, because of long-term per- 
sistence and population structure; (ii) genetic sig- 
nature of population expansion in unstable areas, 
reflecting multispecies colonization from adjacent 
refugial regions after the Last Glacial Maximum 


(LGM, 21 kybp); (iii) absence of genetic patterns of 
isolation-by-distance in unstable areas, given that 
colonization has been too recent to permit restora- 
tion of equilibrium between migration and genetic 
drift (5); and (iv) strong phylogeographic structure 
between refugia, reflecting assemblage-wide, long- 
term population persistence in isolated areas. 
Distribution models developed under current 
climatic conditions accurately predict distribu- 
tions of each of the target species along the 
Atlantic rainforest domain [area-under-the-curve 
(AUC) values (/6) 0.968, 0.989, and 0.994; 
maximum Kappa (/7) 0.81, 0.925, and 0.94 in 
#. albomarginatus, H. faber, and H. semilineatus, 
respectively (fig. S2)]. Stability maps, depicting 
the intersection of distribution models for each 
taxon under current, 6 kybp, and 21 kybp cli- 
mates, predict for all species a large central re- 
fugium throughout the Late Quaternary (“Bahia 
refugium”) (Fig. 2). A second, much smaller 
refugium is predicted in the northeasternmost 
portion of the forest (“Pernambuco refugium”). 
In H. faber, a third, southeastern refugium of 
intermediate size is also predicted (“Sao Paulo 
refugium”). This is not surprising, given that this 
species occupies a broader environmental niche. 
In contrast to the central and northern regions, 
populations south of the Bahia or Sao Paulo 
refugia appear much less stable, despite the more 
extensive (preclearing) range of the forest in 
southern and southeastern Brazil. We hypothe- 
size that these areas received a significant influx 
of migrants from adjacent, large refugial pop- 
ulations after the LGM. These palaeomodel re- 
sults are congruent with the fossil pollen record, 
which documents a replacement of forests by 
grasslands in the southern Atlantic forest during 
the LGM (/4, /8) and suggests the occurrence of 
small forest refugia in the southernmost range of 


the putative Bahia refugium (/9). The results also 
agree generally with forest models published 
previously (/4), although the central refugium 
extends farther south in the frog-based models. 
Such differences are expected because the forest 
and its associated species may differ slightly in 
their climatic tolerances and realized niches. In 
H. albomarginatus and H. faber, the extension of 
the predicted Sao Paulo refugium westward into 
the neighboring Cerrado biome reflects model 
overprediction (fig. S2) (14). 

Models of habitat stability through fluctuating 
climates correctly predict patterns of phylogeog- 
raphy in the Brazilian Atlantic rainforest (Fig. 2 
and figs. S3 to S5). In all species, high levels of 
divergence and population structure are observed 
across refugia (Tamura-Nei corrected distances 
(20): 4 to 7% between Bahia and Pernambuco 
refugia, 1% between the nearby Bahia and Sao 
Paulo refugia in H. faber). Similarly, in all taxa 
there are multiple, divergent clades within the 
Bahia region, agreeing with model-based pre- 
dictions of a large refugium in this area. In H. 


faber, divergent clades are also represented in the 


Sao Paulo region, matching predictions of a mid- 
sized refugium in this area. All taxa show low 
genetic diversity across the southernmost range of 
the forest, an area predicted to be less stable by the 
palaeomodels. Furthermore, mitochondrial DNA 
(mtDNA) lineages found in this region are shared 
with adjacent refugia (one in H. albomarginatus 
and H. semilineatus, two in H. faber). 

Metrics of genetic diversity confirm the above 
patterns (Table 1). In H. albomarginatus and H. 
semilineatus, genetic diversity (2/) is an order of 
magnitude larger in the central (Bahia) refugium 
relative to the less stable (southern) portion of the 
forest. Diversity of H. faber in this southern area 
is higher than the other species because of the 


Species 
occurrence 
data 


Map of predicted 
stable areas 
(putative refugia) 


Current & 
historical climate 
data 


distribution of 
genetic diversity 


Model 
Validation 


Descriptive 
phylogeography 


Spatially explicit hypotheses Re: 


Genetic tests of 
stability/expansion, 
divergence times 


spatial-temporal patterns 
of colonization 
and/or vicariance 


Sampling across predicted stable and unstable areas 


congruence 
across taxa 


Assemblage-scale 
hypothesis testing 
(HABC) 


Fig. 1. Proposed method of biodiversity prediction. Three stages are involved: biodiversity dis- 
tribution modeling (top), model-based hypothesis formulation (middle), hypothesis testing and 


model validation (bottom). 
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presence of two lineages that co-occur in the ad- 
jacent refugia. In all species, average net nucle- 
otide differences across localities (22) reflects 
high geographic structure within refugia (2.6 to 


6.2% divergence). In contrast, sites located 
outside (south of) the refugia are genetically 
more similar to each other, although to a lesser 
extent in H. faber (0.1 to 1.6%). Signatures of 


population expansion (23) are found in the 
unstable area for H. albomarginatus and H. faber, 
as well as in the Bahia refugium area for H. faber 
and H. semilineatus. The lack of signature of 


Fig. 2. Genetic diversity in putative A B Cc 

refugial (stable) versus unstable areas 

in the Brazilian Atlantic rainforest. Oo LA 
(Top) Species-specific stability maps; ; 

modeled refugia in black. (A) H. Pernambuco 
albomarginatus, (B) H. semilineatus, refugium 
(C) H. faber. Note the absence of large 
stable regions in the southern portion 
of the forest (south of the Bahia and 
Sao Paulo refugia) relative to the 


Bahia refugium 
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Table 1. Population genetic summary metrics used in model validation.n, 9, and average D, values of the former were obtained not only from the total Q 


Sample size; S, number of segregating sites. The diversity parameter 9 and mean 
D, across localities are given per base pair (bp). Hs test (23) is used to detect 
population expansion. BA, Bahia; SP, Sao Paulo refugia. Because predicted 
refugia were often larger than predicted unstable (recently colonized) areas, n, S, 


number of samples, but also from all possible combinations of spatially 
contiguous localities distributed within the geographic extension of the unstable 
area. Parentheses encompass minimum and maximum values from subsamples. 
P values in bold highlight statistical significance at 0.05 probability level. 


Species Aras n Ss Mean D, Hs Mantel’s corr. coef. 

(min.; max.) (min.; max.) (min.; max.) (min.; max.) (P value) (P value) 

H. albomarginatus Stable (BA) 36 207 0.076 0.062 —20.546 0.499 
(970 bp) (13; 23) (81; 155) (0.034; 0.072) (0.020; 0.082) (0.141) (0.001) 
Unstable 27 22 0.003 0.001 —11.498 —0.140 

(south of BA) (0.004) (0.580) 

H. semilineatus Stable (BA) 28 71 0.031 0.036 —17.778 0.054 
(718 bp) (6; 13) (14; 58) (0.009; 0.034) (0.007; 0.041) (0.029) (0.460) 

Unstable 15 9 0.003 0.004 0.114 0.436 

(south of BA) (0.357) (0.248) 

H. faber Stable (BA) 28 94 0.018 0.026 —38.111 0.803 
(771 bp) (13; 23) (42; 80) (0.012; 0.022) (0.001; 0.044) (0.003) (0.0003) 

Stable (SP) 15 48 0.023 0.028 —5.981 0.305 

(0.115) (0.221) 

Unstable 18 40 0.015 0.016 —13.255 0.0001 

(south of SP) (0.014) (0.456) 
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population expansion in the southernmost local- 
ities of H. semilineatus may reflect low statistical 
power because of the exceptionally low levels of 
diversity observed in this species. As predicted, 
isolation by distance is not observed in unstable 
regions, but is detected within refugial areas for 
H. albomarginatus and H. faber. 

The hierarchical approximate Bayesian com- 
putation (HABC) method (24) allows us to use 
data from all three species at once to test for 
assemblage-wide responses to Late Quaternary cli- 
mate change. These analyses support both model- 
driven hypotheses of (i) simultaneous, multispecies 
colonization of unstable areas from adjacent re- 
fugial populations since the LGM, as opposed to 
long-term persistence of populations in unstable 
areas, and (ii) assemblage-scale, long-term per- 
sistence of populations in isolated refugial areas, 
as opposed to post-LGM colonization of refugial 
regions. 

To test for assemblage-wide colonization 
of predicted unstable areas, we group mtDNA 
sequences from the southernmost refugial sites 
[population 1 (Fig. 3A)] and from localities in 
unstable areas south of the refugium [population 
2 (Fig. 3A)] to contrast two alternative historical 
models across the three codistributed species, 
while allowing the taxon-specific demographic 
parameters to vary. In H;, the long-term persist- 
ence model, two contemporary populations split 
from an ancestral population prior to the LGM 
(120,000 to 1.2 million years before present, or 
Mypbp, Fig. 3A). In H>, the recent colonization 
model, population 2 is modeled as being colo- 
nized from refugial population 1 subsequent to 


Fig. 3. HABC analyses. Ay: long-term persistence 


(A) Simulated models 
H, (long-term persist- 
ence) and H> (recent 
colonization). In both 
cases, each species was 
modeled as two con- 
temporary populations 
with mutation-drift pa- 
rameters 6, and 02 that 
split from an ancestral 
population at a time t in 
the past. Ancestral pop- 
ulation sizes are repre- 6, 
sented by (6,), and (6,).; 
ybp, years before pres- 
ent. (B to E) Hyperposte- 
rior (bars) and hyperprior 
(dashed) densities of Z> 
(number of species evolved 
under H2) given data from 
three codistributed frog 
species. (B) and (C) Mod- 
els of refugial sites (pop- 
ulation 1) and unstable, 
southern areas (popula- 
tion 2). (D) and (E) Models 
of Pernambuco refugium 
(population 1) and Bahia 


Population 1 


Population 1 


H,: recent colonization 


the LGM (0 to 20 kybp; Fig. 3A). The results 
indicate that all three species colonized the 
southern (unstable) areas after the LGM (Z) = 3, 
the number of species evolved under H3), even 
when allowing for postisolation migration (Fig. 
3, B and C). When Bayes factor is used (25), 
there is strong support for recent colonization in 
all three species (Z = 3) under the no-migration 
model [B(Zy = 3, Z. < 3) = 35.16], and moderate 
support under a postisolation migration model 
[B(Z, = 3, Zo < 3) = 5.70]. 

Using the same framework to test for long- 
term persistence of refugial populations, we com- 
pare mtDNA sequences between the predicted 
Pernambuco refugium [population | (Fig. 3A)] and 
adjacent (northern) populations from the Bahia 
refugium [population 2 (Fig. 3A)] to contrast al- 
ternative historical models H, and Hg. In this case, 
the HABC results infer long-term persistence of 
populations in isolated refugia for all three spe- 
cles (Z = 0, 1.e., Z; = 3), even when allowing for 
postisolation migration (Fig. 3, D and E). Using 
Bayes factor (25), we also detect evidence for 
stability in both areas under the no-migration 
model [B(Z, = 0, Z > 0) = 4.89], as well as under 
a postisolation migration model [B(Z, = 0, Z, > 
0) = 4.84]. 

Relative to nuclear loci, mtDNA data are 
more variable and readily collected and often 
provide key insights into biological response to 
environmental modification (/). Although single- 
locus inference can be imprecise in the face of 
coalescent variance and the possibility of selec- 
tion (26), our method benefits from a multitaxon 
approach, while explicitly accounting for the 


stochasticity of a single-locus coalescent across 
taxa. Combining data sets from several codis- 
tributed groups into a single hierarchical Bayesian 
analysis allowed us to estimate congruence across 
species, while borrowing strength from the full 
comparative phylogeographic sample (24). This 
can translate into higher analytical power and be 
more informative than qualitative comparisons of 
species-specific analyses. By capturing the his- 
torical signal that emerges from larger, combined 
multispecies molecular data sets, HABC will 
offer the possibility of looking at patterns of 
historical community assembly in codistributed 
nonmodel organisms for which barcode-type 
DNA sequence information (e.g., mtDNA data) 
can be feasibly collected. 

Collectively, the results identify the central 
region as a hotspot within the Atlantic rain- 
forest hotspot and a refuge for biodiversity 
during climatic extremes of the Late Pleistocene. 
This is not to say that southern areas entirely 
lacked forested habitats in the late Pleistocene: 
The existence of species and genera endemic to 
the southern forests (27), as well as some palaeo- 
ecological and genetic evidence (28), offer evidence 
to the contrary. Rather, the phylogeographically 
validated palaeomodels presented here show that 
the central region had much higher stability 
relative to the south. Forest lizards (/4, 29) and 
birds (30) also show high diversity in the central 
portion of the biome relative to southern areas, 
and provide evidence for population expansion in 
southern regions. This reassures us that the pro- 
cesses uncovered by the amphibian data may be 
generalized to and help to explain patterns of 
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refugium (population 2). (B) and (D) Postisolation migration not included in model; (C) and (E) postisolation migration included in model. 
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diversity in other, much more distantly related 
groups of Atlantic forest endemics. 

Because collection efforts, molecular studies, 
and conservation priorities have been heavily 
biased toward southern and southeastern Brazil 
(8, 9, 31), we predict that genetic diversity and 
narrow endemism in the central corridor of the 
biome have been substantially underestimated. 
This is serious, given the higher rate of deforesta- 
tion in this region relative to the more extensive 
forests in Sao Paulo and southern Brazil (9, 3/). 
Not only could much unique diversity be lost, but 
ongoing habitat destruction could quickly erase 
the signature of the historical processes that led to 
it, preventing a full understanding of the mecha- 
nisms underlying local endemism and, therefore, 
impeding more effective conservation measures. 

At a broader level, the congruence between 
model-based demographic hypotheses and joint, 
multispecies analyses of mtDNA diversity shows 
that palaeoclimatic niche models and assemblage- 
scale molecular genetic analyses can be used to 
forecast spatial patterns of diversity in poorly 
explored, highly threatened ecosystems. In a 
world of ever-accelerating environmental changes, 
this approach can help to guide research and 
conservation in other global hotspots or similarly 
complex tropical ecosystems. 


References and Notes 

1. G. Hewitt, Nature 405, 907 (2000). 

2. F. E. Mayle, D. J. Beerling, W. D. Gosling, M. B. Bush, 
Philos. Trans. R. Soc. London B Biol. Sci. 359, 499 (2004). 


3. E. P. Lessa, J. A. Cook, J. L. Patton, Proc. Natl. Acad. 
Sci. U.S.A. 100, 10331 (2003). 

4. B. W. Nelson, C. A. C. Ferreira, M. F. da Silva, 
M. L. Kawasaki, Nature 345, 714 (1990). 

5. C.H. Graham, C. Moritz, S. E. Williams, Proc. Natl. Acad. 
Sci. U.S.A. 103, 632 (2006). 

6. W. Jetz, C. Rahbek, R. K. Colwell, Ecol. Lett. 7, 1180 (2004). 

7. A. Hugall, C. Moritz, A. Moussalli, J. Stanisic, Proc. Natl. 
Acad. Sci. U.S.A. 99, 6112 (2002). 

8. L. P. C. Morellato, C. F. B. Haddad, Biotropica 32, 786 
(2000). 

9. M. T. Rodrigues, Conserv. Biol. 19, 659 (2005). 

10. C. Kremen et al., Science 320, 222 (2008). 

11. Materials and methods are available as supporting 
material on Science Online. 

12. M. B. Aratijo et al., Ecography 31, 8 (2008). 

13. D. R. Frost, Amphibian Species of the World: An 
Online Reference, version 5.2, 15 July 2008 (American 
Museum of Natural History, New York, 2008); electronic 
database accessible at http://research.amnh.org/ 
herpetology/amphibia/index.php. 

14. A.C. Carnaval, C. Moritz, J. Biogeogr. 35, 1187 (2008). 

15. M. Slatkin, Evolution 47, 264 (1993). 

16. J. A. Hanley, B. J. McNeil, Radiology 143, 29 (1982). 

17. J. Cohen, Ed. Psych. Meas. 20, 37 (1960). 

18. H. Behling, R. R. B. Negrelle, Quat. Res. 56, 383 (2001). 

19. H. Behling, H. W. Arz, J. Patzold, G. Wefer, Palaeogeogr. 

Palaeoclimatol. Palaeoecol. 179, 227 (2002). 

20. K. Tamura, M. Nei, Mol. Biol. Evol. 9, 678 (1993). 

21. F. Tajima, Genetics 143, 1457 (1996). 

22. M. Nei, Molecular Evolutionary Genetics (Columbia Univ. 

Press, New York, 1987). 

23. J. C. Fay, C. |. Wu, Genetics 155, 1405 (2000). 

24. M.]. Hickerson, C. P. Meyer, BMC Evol. Biol. 8, 322 (2008). 

25. R. E. Kass, A. E. Raftery, J. Am. Stat. Assoc. 90, 773 

(1995). 

26. J. W. O. Ballard, M. Kreitman, Genetics 138, 757 (1994). 
27. C. F. B. Haddad, L. F. Toledo, C. P. A. Prado, Anfibios da 
Mata Atlantica (Atlantic Forest Amphibians) (Editora 

Neotropica, Sao Paulo, Brazil, 2008). 


REPORTS 


28. M.-P. Ledru et al., Divers. Distrib. 13, 761 (2007). 

29. K. C. M. Pellegrino et al., Biol. J. Linn. Soc. London 85, 
13 (2005). 

30. G. S. Cabanne, F. M. d’Horta, E. H. R. Sari, F. R. Santos, 
C. Y. Miyaki, Mol. Phylogenet. Evol. 49, 760 (2008). 

31. J. M. C. da Silva, M. Tabarelli, Nature 404, 72 
(2000). 

32. We thank U. Caramaschi and H. Zaher for providing 
access to collections MNR] and MZUSP; O. Peixoto, 
M. Gomes, A. Muri, R. Kautskyi, S. Lima, E. Santos, 
J. S. Filho, J. V. Filho, G. Barros, J. Queiroz, R. Araujo, 
L. Japp, H. Japp, J. Giovanelli, J. Alexandrino, L. Toledo, 
O. Araujo, G. Egito, J. Zina, D. Loebmann, D. Pavan, 
R. Amaro, V. Verdade, F. Curcio, M. Dixo, and J. Cassimiro 
for field work assistance; W. Monahan and R. Hijmans for 
discussions about the modeling work; L. Smith and 
D. Turong for DNA-sequencing assistance; R. Pereira, 
R. Damasceno, S. Rovito, J. Kolbe, S. Singhal, 
R. Puschendorf, and A. Pounds for discussions about 
earlier versions of the manuscript. Funding was provided 
by the NSF (awards DBI 0512013 to A.C.C., DEB 
0743648 to M.J.H., DEB 416250 and DEB 0817035 to 
C.M.), Fundagao de Amparo a Pesquisa do Estado de Sao 
Paulo and Conselho Nacional de Desenvolvimento 
Cientifico e Tecnoldgico (grants to C.F.B.H. and M.T.R.). 
Sequences are deposited in GenBank (F]502639- 
F]502822). A.C.C. and C.M. designed the study; A.C.C., 
C.F.B.H., and M.T.R. collected the data; A.C.C., C.M. and 
M.J.H. analyzed the data; A.C.C. wrote the paper. All 
authors discussed the results and commented on earlier 
versions of the manuscript. 


Supporting Online Material 
www.sciencemag.org/cgi/content/full/323/5915/785/DC1 
Materials and Methods 

Figs. $1 to S6 

Tables $1 and $2 

References 


8 October 2008; accepted 9 December 2008 
10.1126/science.1166955 


Orc1 Controls Centriole and 
Centrosome Copy Number in 


Human Cells 


Adriana S. Hemerly,”” Supriya G. Prasanth,’* Khalid Siddiqui,’+ Bruce Stillman7+ 


Centrosomes, each containing a pair of centrioles, organize microtubules in animal cells, 
particularly during mitosis. DNA and centrosomes are normally duplicated once before cell division 
to maintain optimal genome integrity. We report a new role for the Orc1 protein, a subunit of 
the origin recognition complex (ORC) that is a key component of the DNA replication licensing 
machinery, in controlling centriole and centrosome copy number in human cells, independent of 
its role in DNA replication. Cyclin A promotes Orc1 localization to centrosomes where Orcl 
prevents Cyclin E-dependent reduplication of both centrioles and centrosomes in a single cell 
division cycle. The data suggest that Orc1 is a regulator of centriole and centrosome reduplication 


as well as the initiation of DNA replication. 


he assembly of a bipolar, microtubule spin- 

dle during mitosis is essential for accurate 

chromosome segregation. In animal cells, 
spindle formation is organized by centrosomes, 
organelles that contain a pair of centrioles sur- 
rounded by pericentriolar material (PCM) that 
need to be duplicated exactly once every cell 
division cycle, in coordination with DNA repli- 
cation to maintain genome stability (7). Licens- 
ing DNA for replication is a critical regulatory 


step involving the origin recognition complex 
(ORC), the first component for assembly of a 
pre—replicative complex (pre-RC) at each origin 
(2). Accumulated evidence supports roles for ORC 
subunits in addition to licensing DNA replication 
(3). In particular, human Orc2 subunit localizes to 
centrosomes, and depletion of Orc2 and Orc3 
causes centrosome amplification in mitosis (4). 
Several regulators of the DNA licensing 
machinery have been reported to be involved 


in the control of both DNA and centriole du- 
plication (5). Both Cyclin E and Cyclin A, as 
well as Cdk2 activity, are well-known positive 
regulators of DNA replication and also promote 
centrosome duplication (or reduplication) (6-//). 
Depletion of the DNA replication licensing in- 
hibitor Geminin causes reduplication of both 
DNA and centrosomes in human cells (/2). 
In a screen using small interfering RNA 
(siRNA) for human ORC proteins with roles in 
centrosome biology, we found that depletion of 
the largest human ORC subunit, HsOrc1, leads to 
excess centrosomes (fig. S1 and Fig. 1A). Orcl 
siRNA-treated U2OS cells were analyzed for cen- 
trosome defects by dual-color indirect immuno- 
fluorescence (IF) using antibodies to centrin 2 
(stains centrioles) and antibodies to y-tubulin 
(stains centrosomes). Seventy-two hours after 
siRNA treatment, 39.77 + 3.5% of cells trans- 
fected with Orc1-1 siRNA and 25.53 + 0.3% of 
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cells transfected with Orcl-2 siRNA showed 
multiple centrosomes and centrioles, in com- 
parison with 2.4 + 1.2% of control cells (fig. 
S1C). In Orcl-1 treated cells, 33.29 + 2.6% of 
the multiple centrosomes were separated rather 
than linked, and the reduplication occurred in 
different human cell lines (figs. S1 and S2). 
To study the kinetics of centriole and cen- 
trosome reduplication after Orc! depletion, U2OS 
cells were synchronized in mitosis by nocodazole 
treatment and released into the next cycle. Cen- 
triole and centrosome numbers were assessed 12 
hours, 36 hours, and 60 hours after nocodazole 
release (Fig. 1B and fig. S3). Disorganized dis- 
engagement of duplicated centrioles (2Pairs Dis.) 
was observed 12 hours after release when most of 
the cells were in G1 or early S phase (Fig. 1B and 
fig S3, centrin 2). This was followed by an in- 
crease in the percentage of cells showing centriole 
reduplication (>2Pairs). The complete centrosome 
structure, as evidenced by y-tubulin staining, was 
observed later, about 60 hours after release (y- 
tubulin). The very early effect of Orc1 depletion 
on centriole disengagement and reduplication 
suggests that Orcl might have a more direct 
effect on these processes, whereas complete cen- 
trosome formation follows centriole reduplica- 
tion. Similar dynamics of centriole reduplication 
after Orc! depletion were observed in HeLa cells 
(fig. S4). It has been proposed that centriole 
engagement blocks reduplication (3). In Orcl- 
depleted cells, however, the duplicated centrioles 
were found disengaged and underwent redupli- 
cation, suggesting that Orel may block redupli- 
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Fig. 1. Orc1 depletion causes centriole and centrosome reduplication. (A) 
Human U20S cells treated for 72 hours with Orcl-1 siRNA duplex were 
coimmunostained for centrioles with antibodies to centrin 2 (green, a to e) and 
centrosomes with antibodies to y-tubulin (red, a’ to e’). Insets are higher 
magnification. DNA was stained with 4°,6 -diamidino-2-phenylindole (DAPI) 
(blue), and merged images are shown in a” to e”. Scale bar, 10 um. (B) 
Quantification of centrosome and centriole numbers by y-tubulin (top) and 


cation by participating in processes of centriole 
engagement of the newly duplicated centrioles. 

Different mechanisms for centrosome am- 
plification have already been reported. Apparent 
multiple centrosomes can arise by centrosome 
fragmentation, centriole splitting that occurs 
during early mitosis after DNA damage (/4), or 
authentic reduplication. Analysis of multiple 
parameters—including DNA content (fig. S1B), 
cell cycle markers (fig. S5), centriolin staining of 
mature centrosomes (figs. S6 and S7), and the 
numbers of centrioles per PCM (Fig. 1A)—all 
indicate that Orc1 depletion causes bona fide cen- 
triole and centrosome reduplication in U2OS cells 
(see supporting material). 

To address whether overexpression of Orcl 
would affect centrosome reduplication caused by 
prolonged S-phase arrest in the presence of hy- 
droxyurea (HU) (9, 75), U2OS cells were tran- 
siently transfected with YFP-Orel V7, YFP-Orc2, 
or a vector carrying only yellow fluorescent 
protein (YFP) in the presence of HU (Fig. 2, A 
and D, and fig. S8). Centrosome reduplication 
after HU treatment, but not normal duplication, 
was substantially inhibited in YFP-Orc1" cells, 
whereas YFP or YFP-Orc2 did not have any 
effect, suggesting a role for Orc! as an inhibitor 
of centrosome reduplication. 

YFP-Orcl associated with centrosomes in 
addition to its nuclear localization in ~35% of 
transfected cells (fig. S9A and movie S1). Further- 
more, endogenous Orc! and other ORC subunits 
cofractionated with purified centrosomes (fig. 
S9B). Transiently expressed Orcl fused to the 
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pericentrin-AKAP450 centrosomal targeting 
(PACT) domain, a conserved motif that targets 
proteins to centrosomes (/6), resulted in inhibi- 
tion of centrosome reduplication in HU-treated 
U20S cells (Fig. 2, B and D), suggesting that 
Orcl controls centrosome copy number when lo- 
cated at centrosomes. 

Centrosome reduplication was inhibited by 
the full-length and N-terminal region of Orcl 
(amino acids 1 to 500; Orel" Nterm.Flag) (Fig. 
2C), the latter of which is deficient in ORC 
assembly (/7). Both Orc] versions also inhibited 
centrosome reduplication when expressed after 
the application of the HU block, indicating that 
the inhibition was not caused by a G1 arrest in 
these cells (fig. S10). Thus, Orcl control of cen- 
trosome copy number is separate from its DNA 
replication role and does not require assembly of 
ORC, but such assembly may facilitate a more 
efficient block. 

Orcl depletion in synchronized U2OS cells 
did not trigger early responses of DNA damage 
checkpoint pathways that are known to affect 
centriole and centrosome copy number (/8, /9). 
Activation of Chk1 by phosphorylation on serine 
317 was not detected, and p27 and p53 pathways 
were activated well after centriole amplification 
was initiated (Fig. 1B and fig. S11). 

Several reports have shown that Cyclin E 
overexpression in mammalian cells is linked to 
genome and centrosome abnormalities (20, 2/) 
and that centrosome reduplication requires Cdk2 
activity (7-9, 11). Elevated levels of Cyclin E 
were identified as early as 12 hours after release 
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centrin 2 (bottom) immunostaining. Cells were harvested at 12, 36, or 60 
hours after nocodazole release. Error bars, 1 SE. 1C, one centrosome; 2C(L), 
two centrosomes linked; 2C(S), two centrosomes separated; >=3C(L), three or 
more centrosomes linked; >=3C(S), three or more centrosomes separated. 
1Pair, one centriole pair; 2Pairs (L), two centriole pairs linked; 2Pairs (S), two 
centriole pairs separated; 2Pairs (Dis.), two centriole pairs disorganized and 
disengaged; >=2Pairs, more than two centriole pairs. 
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of synchronized Orcl-depleted U2OS cells in 
Cyclin A—negative cells (Fig. 3, A and B, and fig. 
S5, A and B) and continued to increase. We thus 
hypothesized that Orc! control of centriole and 
centrosome reduplication involved Cyclin E and 
Cdk2. The Cdk2 inhibitor roscovitine suppressed 
centriole and centrosome reduplication in Orcl- 
depleted cells (fig. S12). In a second experiment, 
simultaneous depletion of Orel and Cyclin E 
blocked centriole and centrosome reduplication, 
whereas normal duplicated centriole and centro- 
some numbers were observed (Fig. 3C and fig. 
S13A). Cyclin E siRNA did not change the 
number of cells with a 4C DNA content after 
Orc1-1 treatment (fig. $13B), which implies that 
the block of centrosome overduplication in Orc1- 
1—-depleted cells by Cyclin E siRNA was not due 
to a compete G1 arrest. In contrast, Cyclin A 
siRNA did not abolish centrosome reduplication 
in Orcl-depleted cells (Fig. 3C). 

We tested the ability of elevated levels of 
cyclins to cause centrosome reduplication in HU- 
treated cells expressing YFP.Ore1™’ (Fig. 4A). 
Coexpression of Cyclin E with YFP.Orel“? 
blocked YFP.Orc1™" inhibition of HU-induced 
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centrosome reduplication, whereas neither Cyclin 
A nor Cyclin B had any effect (Fig. 4, A and B). 
HU-treated cells coexpressing YFP.Orel“" with 
Cyclin A showed a cytoplasmic pool of over- 
expressed YFP.Orc1™" in addition to the nuclear 
localization and an enhancement of YFP-Orcl 
located at centrosomes (Fig. 4B, b to b”) (The 
number of YFP-Orcl—positive centrosomes in 
the presence of Cyclin A increased six-fold com- 
pared with YFP-Orcl alone). Coexpression of 
Cyclin E with YFP.Orcl“'PACT also elimi- 
nated the ability of YFP.Orcl "PACT to block 
centrosome reduplication, whereas Cyclin A or 
Cyclin B had no effect (Fig. 4, A and B). 
Altogether, the results suggest that Orcl might 
directly regulate centriole and centrosome copy 
number in human cells and that the mechanism 
involves Cyclin E. 

Orcl coprecipitated with Cyclin E and Cyclin 
A, as previously shown (22), whereas no asso- 
ciation was observed with Cyclin B (Fig. 4C). 
Association in a complex with Cyclin E was 
stronger than with Cyclin A; however, so far we 
have not been able to find evidence that Orcl 
binds Cyclin E directly. Nevertheless, purified 
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wild-type Orcl inhibited Cyclin E-Cdk2 and 
Cyclin A-Cdk2 kinase activity using histone H1 
as substrate (fig. S14B). 

The Orcl “RXL” mutant (K235A-L237A, 
Orel“) impairs the direct interaction with 
Cyclin A but does not affect Cyclin E binding 
(Fig. 4C and fig. S14A) and cannot block Cyclin 
A-associated kinase activity (fig. S14B). Al- 
though some inhibition was observed, Orel“ 
and Orel“““Nterm did not block centrosome 
reduplication as efficiently as did wild-type Orc1, 
which suggests that Orc1-Cyclin A interaction is 
required for optimal centrosome copy number 
control (Fig. 4D). Orcl““.PACT blocked cen- 
trosome reduplication to the same extent as 
Orel“ "PACT (Fig. 4E). Because Cyclin A 
function has been previously linked to the cyto- 
plasmic localization of Orcl (23), possibly 
Orel** and Orcl“~“.Nterm mutants were not 
exported efficiently to centrosomes to block HU- 
induced reduplication, whereas Orcl1“““. PACT 
was directly targeted to centrosomes. Accord- 
ingly, Cyclin A stimulation of Orc! association 
with centrosomes was significantly reduced when 
coexpressed with the RXL Orcl*“ mutant. 
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Fig. 2. Overexpression of Orc1 blocks centrosome reduplication. U20S 
asynchronous cells were transfected with the indicated constructs and 
treated with 16 mM hydroxyurea (+HU). Untransfected cells were treated 
or not with HU (UNT +HU and UNT -HU, respectively). Cells were har- 
vested at 68 hours after HU treatment. Centrosome numbers were scored 
by y-tubulin immunostaining in YFP- or Flag-positive cells. (A to C) Quan- 
tification of multiple centrosomes in HU-arrested cells. Error bars, 1 SE. 


(D) Cells were immunostained for centrosomes with antibody to y-tubulin 
(red, a’ to i’); YFP expression was immunostained with antibody to 
green fluorescent protein (GFP) in (A) (green, a to d) or visualized di- 
rectly in (B) (e and f); and Flag expression was immunostained with 
antibody to Flag in (C) (g to i). DNA was stained with DAPI (blue), and 
merged images are shown in a” to i’. Scale bar, 10 um. Insets are higher 
magnification. 
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Altogether, the data indicate that both Cyclin E 
and Cyclin A play a role in Orc! mediated control 
of centrosome numbers. 

ORC in human cells is dynamically assem- 
bled and disassembled during the cell cycle and 
participates in many aspects of chromosome du- 


plication and segregation, including functions at 
kinetochores, centromeres, and heterochromatin 
(3). Orc2 is localized to centrosomes, but its de- 
pletion affects centrosome copy number in mi- 
tosis in addition to controlling DNA replication, 
centromere activity, and chromosome structure 


(4). In contrast, Orc] controls Cyclin E-dependent 
centrosome copy number by directly preventing 
reduplication of centrioles immediately after cen- 
trioles duplicate upon commitment to cell division. 
Without Orcl, duplicated centrioles are disen- 
gaged and extensive Cyclin E-dependent centriole 
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Fig. 3. Orcl depletion causes Cdk2 and Cyclin E—dependent centriole and centrosome 
reduplication. (A and B) Nocodazole-arrested U20S cells were transfected with control (GL3) 
or Orci1-1 siRNA, then released into the next cycle and retransfected with the same siRNA 
duplexes. (A) Immunoblot of whole-cell extract of cells harvested at the indicated time points. 
Orci, Cyclin E, Cyclin A, and a-tubulin levels were assessed by immunoblotting with specific 
antibodies. CycE*, a longer exposure. (B) Cells were harvested at 12 hours after nocodazole 
release and coimmunostained with antibody to Cyclin E (green) and antibody to Cyclin A (red). 
DNA was stained with DAPI (blue), and merged images are shown in a” and b”. Scale bar, 30 GL3 
um. (C) Quantification of centrosome and centriole numbers by y-tubulin (top) and centrin 2 

(bottom) immunostaining, respectively, of asynchronous U20S cells transfected with the indicated siRNAs. Cells were analyzed 72 hours after the first transfection. 
Error bars, 1 SE. 
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reduplication occurs. In the absence of Orc! and 
Cyclin E, centrosome reduplication does not occur; 
however, centrioles duplicate, which suggests that 
such duplication is Cyclin A—dependent, consist- 
ent with previous studies (24). Similarly, Cyclin 
E, but not Cyclin A, overrides the Orc! inhibition 
of centrosome reduplication. 

Cyclin E is required for centrosome dupli- 
cation and reduplication and coprecipitates with 
Orel (7, 11, 25-27). We have not been able to 
detect direct binding between Cyclin E and Orcl, 
which suggests that a mediator protein may fa- 
cilitate the binding. The mediator may be another 
DNA replication protein implicated in centro- 
some biology (/2, 28). Furthermore, depletion of 
Orcl increases Cyclin E protein levels in cells 
that may contribute to centriole reduplication. 
Both proteins are degraded by SKP1/cullin/F-box 
protein (SCF), which localizes to centrosomes and 
has also been implicated in the control of centro- 
some duplication (29, 30). 

There is a short time during the cell division 
cycle after centrioles have duplicated when Cyclin 
E is still present in cells (late G1 and early S 
phase). During this time, if Cyclin E activity is not 
checked, centriole reduplication might occur. We 
suggest that a mechanism involving Cyclin A— 
dependent localization of Orel to centrosomes 


prevents Cyclin E-dependent centriole reduplica- 
tion during this time window (fig. S15). In this 
manner, Orcl plays a dual role in coordinating 
DNA replication and centrosome copy number. 
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Cytokines such as interleukin-6 induce tyrosine and serine phosphorylation of Stat3 that results in 
activation of Stat3-responsive genes. We provide evidence that Stat3 is present in the mitochondria 
of cultured cells and primary tissues, including the liver and heart. In Stat3~~ cells, the activities 
of complexes | and II of the electron transport chain (ETC) were significantly decreased. We identified 
Stat3 mutants that selectively restored the protein’s function as a transcription factor or its 
functions within the ETC. In mice that do not express Stat3 in the heart, there were also selective 
defects in the activities of complexes | and II of the ETC. These data indicate that Stat3 is required for 
optimal function of the ETC, which may allow it to orchestrate responses to cellular homeostasis. 


inding of cytokines to their cell surface 
Bees activates the Jak protein tyro- 

sine kinases that phosphorylate conserved 
tyrosine residues on Stat proteins (/). Tyrosine- 
phosphorylated Stats translocate to the nucleus 
and bind to the promoters of early response genes. 
Several of the Stats are also phosphorylated on a 
conserved serine residue. 

GRIM-19 was identified as a protein in- 
volved in interferon-B (IFNB)}- and retinoic acid— 
stimulated apoptosis of breast cancer cells (2). 
GRIM-19 inhibits Stat3 transcriptional activity, 
and these proteins directly interact (3, 4). GRIM- 


19 was also purified as a component of complex I 
of the electron transport chain (ETC) (5). GRIM- 
19 mice are embryonically lethal, and in 
GRIM19~ embryonic stem cells, assembly of 
complex I (of which there are 46 known com- 
ponents) is defective. The absence of GRIM-19 
also influences the assembly and function of other 
complexes of the ETC (6). These observations 
indicate that Stat3 and GRIM-19 might colocalize 
in the mitochondria. 

To address this possibility, we used SDS— 
polyacrylamide gel electrophoresis (SDS-PAGE) 
to separate isolated mitochondria and cytoplasm 


from the livers and hearts of mice, and we used 
Western blotting to detect Stat3 (Fig. 1A) (7). 
Stat3 was present in mitochondria and cytoplasmic 
fractions. The amount of Stat3 in the mitochon- 
dria (mStat3) was about one-tenth that in the cy- 
tosol. The blots were also probed for GRIM-19, 
tubulin as a cytoplasmic marker, calreticulin as 
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an endoplasmic reticulum marker, and porin as a 
mitochondria marker. In most experiments, we 
detected little or no tubulin or calreticulin in the 
mitochondrial fraction. Although GRIM-19 is 
reported to be located in the nucleus and cytosol, 
we only observed the protein in mitochondria 
(3, 4). Similar results were obtained from mitochon- 
dria isolated from the mouse brain, kidney, and 
primary splenocytes, as well as several other mouse 
and human cell lines. We have yet to find a tissue 
or cell line that does not contain Stat3 in the 
mitochondrial fraction. 

To confirm that mStat3 did not represent con- 
tamination from the cytoplasm, we immunobloted 
proteins from increasing amounts of purified 
mitochondria for Stat3, tubulin, and cytochrome 
c (Fig. 1B). The ratio of Stat3 to tubulin in the 
cytoplasm was 0.9 in the heart and 1.0 in the 
liver. The ratio of Stat3 to tubulin in the heart 
mitochondria ranged from 13 to 19 and in the 
liver mitochondria ranged from 4 to 9. If the Stat3 
detected in the mitochondria was due to cytosolic 
contamination, then the ratio of Stat3 to tubulin 
should have been ~1. 

To determine whether mStat3 was on the 
outer membrane, we incubated liver mitochon- 
dria with proteinase K in the presence or absence 
of triton X-100. Stat3 and GRIM-19 (an inner 
membrane protein) were resistant to proteinase 
K, whereas Bel-2 and porin, both of which are 
localized on the outer membrane, were degraded 
(Fig. 1C). When proteinase K was added to 
mitochondria in the presence of triton X-100, all 
of the proteins were degraded. These results 
indicate that Stat3 does not adhere to the outer 
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Fig. 1. Stat3 in the mitochondria from mouse heart and liver. (A) © 
Whole cell (WCE), cytoplasmic (Cyto), and mitochondrial (Mito) extracts 
were separated by SDS-PAGE. The blots were probed with antisera 
against Stat3, o-tubulin, calreticulin, and GRIM-19. (B) Increasing 
amounts of heart and liver mitochondria probed for Stat3, tubulin, and 
cytochrome c. (C) mStat3 is proteinase K—resistant. Mitochondria were 
incubated with (lanes 2 and 3) or without (lane 1) proteinase K (PK). To 
disrupt mitochondrial integrity, triton X-100 was added in the digestion 
buffer (lane 3). Samples were probed for Stat3, porin, Bcl-2, and GRIM- 
19. (D) mStat3 is serine phosphorylated. Purified mitochondria, as well 
as cytosolic and whole cell extracts were prepared from WT mice hearts. 
The immunoblots were probed for either total Stat3 or serine phos- 


membrane of the mitochondria in a potentially 
nonspecific manner and is probably located in 
either the intermembrane space, the inner mito- 
chondrial membrane, or the matrix. 

We detected Stat3 in immunoprecipitates of 
mitochondrial extracts prepared from the liver 
with the use of a monoclonal antibody that cap- 
tures all components of complex I of the ETC 
(Fig. 2) (8). We focused on complex I because 
GRIM-19 is a component of complex I, and Stat3 
has been shown to interact with GRIM-19 (3, 4). 
As a control, identical samples were incubated 
with isotype-matched antibody under the same 
conditions. To ensure antibody specificity, the 
blots were also probed for known components of 
complex I. Immunoprecipitates of complex I con- 
tained Stat3, whereas no immunoreactive Stat3 
or any other proteins were present in the im- 
munoprecipitates using the immunoglobulin G 
(IgG)-matched control. As expected, NDUFA9 
[NADH dehydrogenase (ubiquinone) 1 alpha sub- 
complex 9, where NADH is the reduced form of 
nicotinamide adenine dinucleotide] and GRIM-19 
(both components of complex I) were immuno- 
precipitated with the antibody. The complex I 
antibody did not immunoprecipitate complex III 
core protein 2, complex IV subunit I, or the 19-kD 
adenosine triphosphate (ATP) synthase, a com- 
ponent of complex V (Sth, 6th, and 7th panels in 
Fig. 2). The complex II Fp subunit did immuno- 
precipitate with the complex I antibody (4th panel 
in Fig. 2). These results indicate the presence of 
Stat3 in complex I and possibly II of the ETC. 
Alternatively, there could be an interaction of 
complexes I and II of the ETC. 


Liver B 


Western: 


ss | 


phorylated Stat3, as well as voltage-dependent anion channel (VDAC) 
and a-tubulin as controls for mitochondrial purity. A fluorescent conjugated secondary antibody was used to develop the blots allowing the relative amount of 
total and serine phosphorylated Stat3 to be measured. The ratio of total Stat3 to serine phosphorylated Stat3 in whole cell extracts was 2.5, in cytosolic extracts 


was 2.3, and in mitochondria extracts was 0.3. 


To examine whether mitochondrial Stat3 ex- 
pression might modulate oxidative phosphoryl- 
ation, we performed polarographic studies with the 
use of intact mitochondria isolated from Stat3‘”* 
and Stat3“~ pro-B cells. This assay evaluates the 
integrated function of the ETC coupled to ATP 
synthesis, membrane transport, dehydrogenase 
activities, and the structural integrity of the 
mitochondria. Glutamate was used to drive com- 
plex I-dependent electron transport. Succinate is 
the specific substrate that we used as the electron 
donor for complex II. We were not able to ac- 
cumulate a sufficient amount of mitochondria to 
measure complex III activity. We measured com- 
plex IV activity using N,N,N’,N’-tetramethyl-p- 
phenylenediamine (TMPD) with L-ascorbate. 

Maximal state 3 oxidation rates with adeno- 
sine diphosphate (ADP) (2 mM) were reduced by 
70% in Stat3“ cells with glutamate as a substrate 
for complex I and by 50% in cells with succinate 
as a substrates for complex II (Table 1). Oxi- 
dation rates for TMPD/ascorbate (complex IV) 
were not substantially different, localizing defects 
to complexes I, II, and/or III of the ETC. 

Oxidase activities were measured in mitochon- 
dria permeabilized by freezing and thawing (9). 
The oxidase studies bypass potential defects in 
substrate carriers to exclude defects of substrate 
transport as causes of the decreased oxidation 
rates. NADH oxidase activity requires complexes 
I, II, and IV, and duroquinol oxidase (DHQ) re- 
quires complexes III and IV. Consistent with the 
respiration rates obtained with intact mitochon- 
dria, NADH oxidase activity was reduced by 
65% in mitochondrial preparations from Stat3 
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pro-B cells as compared with that in wild-type 
(WT) pro-B cells (Fig. 3A). In contrast to that of 
NADH oxidase, the activities of DHQ (Fig. 3A) 
were similar in Stat} and WT pro-B cells. 
These results reveal a defect at complexes I and 
II of the ETC in mitochondria from Stat3“ pro-B 
cells. To further localize the ETC defects, we 
measured enzyme activities of the electron transport 
chain complexes (ETC assays) in solubilized 
mitochondria. Consistent with the polarographic 
studies, the defects of complexes I were decreased 
by 40% in Stat3~ pro-B cells as compared with 
WT pro-B cells (Fig. 3B). Complex II activity in 
Stat3~ cells was also decreased by ~85%. 
Complex III and IV activities for both WT and 
Stat3 “" pro-B cells were not affected (Fig. 3B). 


Fig. 2. Stat3 in complex | immunoprecipi- 
tates (IP). Antibody to complex | (Cxl) or a 
nonspecific isotype-matched IgG were incu- 
bated with liver mitochondrial extracts. Immu- 
noprecipitates were resolved on SDS-PAGE and 
probed for Stat3, GRIM-19, NDUFA9, Cx Il Fp 
subunit, Cx Ill core protein 2, Cx IV subunit |, 
and Cx V @ subunit. 
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Fig. 3. Depressed mitochondrial respiration in Stat3~ pro-B cells. (A) Mitochondrial 
oxidase activity in WT and Stat3~ cells: NADH oxidase (left) and DHQ oxidase (right). 
Oxygen consumption is expressed as nanogram atom of oxygen per minute per 
milligram of mitochondrial protein (nAtom O/min/mg) and is presented as mean + 
SD. (B) ETC activities of complexes |, Il, III, and IV. Activities are expressed as milliunits 
(nanomoles per minute) per milligram of mitochondrial protein and were normalized 
to citrate synthase activity. Results are presented as mean + SD. Error bars indicate SD; 


asterisks indicate P < 0.05. 


To determine whether the deficiency in oxy- 
gen consumption in Stat3“~ cells was due to a 
decrease in mitochondrial content, we measured 
the activity of the mitochondrial enzyme citrate 
synthase in WT and Stat3~ pro-B cells. This 
enzyme is an exclusive marker of the mitochon- 
drial matrix, and its activity was similar in Stat3“* 
and Stat3”~ cells (fig. SLA). We also measured 
amounts of porin, cytochrome c, and several com- 
ponents of ETC complexes including NDUFA9, 
NDUFS3 [NADH dehydrogenase (ubiquinone) 
Fe-S protein 3], complex II subunit Fp (70 kD), 
complex II subunit I p (30 kD), and COX-I (cy- 
tochrome c oxidase-I) in WT and Stat3~ cells by 
immunoblotting. All of these proteins were present 
in similar amounts in both cell types (fig. S1B). 
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The amount of mitochondrial DNA—encoded 
genes was similar between WT and Stat3~~ pro- 
B cells (fig. S1C), as were the concentrations of 
mitochondrial-encoded RNAs (fig. S1D). These 
results suggest that mitochondrial content is not 
affected in pro-B cells that lack Stat3. 

To determine whether alterations in ETC were 
directly due to a loss of Stat3 expression, we 
reconstituted Stat3a into Stat3”~ B cells with a ret- 
rovirus expressing Stat3a and a downstream green 
fluorescent protein cDNA to allow identification of 


Table 1. Maximal rates of state 3 respiration in 
mitochondria isolated from WT and Stat3~ pro-B 
cells with glutamate, succinate, or TMPD/-ascorbate 
as substrates, Results are expressed as nAtom O/min/mg 
of mitochondrial protein and are presented as mean + 
SD. Significant differences between Stat3~ and WT 
pro-B cells are shown in the table as the corresponding 
P value. N denotes the number of independent 
experiments. 
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infected cells (/0). Expression of Stat3a restored 
complex I and II activities (see Table 2, legend). 
Immunoblots of isolated mitochondria and cytosol 
from the reconstituted cells confirmed that Stat3a 
was expressed in both cellular fractions (fig. S2A). 

To determine whether Stat3, expressed only 
in mitochondria, restored respiration in Stat3”~ 
cells, we constructed a Stat3 that contains the 
mitochondrial targeting sequence of cytochrome 
c oxidase subunit VIII (MLS) fused to the N 
terminus of the protein (MLS Stat3). This se- 
quence places the protein of interest in the inner 
mitochondrial membrane. Fractionation of MLS 
Stat3—expressing cells demonstrated that Stat3 
was present only the mitochondria (fig. S2B). 
Cells expressing MLS Stat3 had about the same 
amount of Stat3 in the mitochondria as did WT 
cells, but they had less total Stat3 than WT cells. 
MLS Stat3 cells (Table 2, row 3) showed similar 
activities of complexes I and II of the ETC as 
Stat3”* or Stat3~ cells that expressed Stat3a 
(Table 2, row 1). 

To determine whether tyrosine and serine 
phosphorylation and DNA binding activity of 
Stat3 are important for regulation of the ETC, 
MLS Stat3 that contained mutations in its DNA 
binding domain (//), Tyr’ (Table 2, rows 4 and 


Table 2. Activities of complexes | and Il of the 
respiratory chain in mouse Stat3” pro-B cells 
reconstituted with Stat3 mutants. Complex | and II 
activity in Stat pro-B cells and different mutants 
of Stat 3 expressed in terms of percentage values, 
considering the activity of complexes | and II being 
100% in Stat3~ pro-B cells reconstituted with WT 
Stat3a. (Stat3~ cells reconstituted with Stat3a 
showed complex | activity of 111 + 9 and complex II 
activity of 74 + 29, as compared with that seen in 
WT pro-B cells.) Results are presented as mean + 
SD. Significant differences with P values of <0.05 
were observed between Stat3" +Stat3a and 
Stat3~ cells reconstituted with Stat3 mutants. 


Cell type Complex | Complex II 

1 Stat3~~ + 100 100 
Stat3a (N = 7) (N = 3) 

2 Stats + 44 + 26 46 + 26 
MSCV (N = 7)* (N = 3)* 

3 Stat3~ + 82 + 20 114 + 42 
MLS Stat3a (N = 6) (N = 3) 

4A Stat3~ + 83 +34 124 +75 
MLS Stat3a (N = 4) (N = 3) 
Y705F 

5 Stat3/~ + 28 + 19 39 +30 
MLS Stat3a (N = 4)* (N = 3)* 
Y705F/S727A 

6 Stats’ + 98 + 20 115 +8 
MLS Stat3a (N = 3) (N = 3) 
Y705F/S727D 

7 Stat37~ + 90 +6 126 +4 
MLS Stat3a (N = 3) (N = 3) 
E434A/E435A 

8 Stat3/~ + 61+4 16+5 
Stat3CA (N = 3)* (N = 3)* 

*P < 0.05 


7), or Ser’”” and Tyr’ (Table 2, row 5) was ex- 
pressed in Stat3“" cells. Mutation in either Tyr” 
or the DNA binding domain (Table 2, rows 4 
and 7) restored activity of complexes I and II 
(Table 2). Because MLS Stat3 E434A/E435A 
(72) cannot form a dimer, it is possible that Stat3 
exerts its actions in the mitochondria as a mono- 
mer (1/). Consistent with this observation, Stat3~ 
cells expressing a constitutively active Stat3 (Stat3a 
CA) that forms a dimer without being tyrosine 
phosphorylated (73) did not restore mitochondri- 
al respiration (Table 2, row 8). 

Expression of MLS Stat3 with serine 727 
mutated to an alanine and tyrosine 705 mutated 
to a phenylalanine (row 5) also did not restore 
activity of complex I or II. To further character- 
ize the role of Ser’?’, we mutated Tyr” to a 
phenylalanine and Ser’”’ to either an alanine 
(Y705F/S727A) (row 5) or an aspartic acid 
(Y705F/S727D) (row 6). The former functions 
as a dominant negative and the latter as a mimetic 
of a constitutively serine-phosphorylated Stat3. 
MLS Stat3 Y705F/S727D reconstituted complex 
I and II activities (Table 2, row 6), whereas MLS 
Y705F/S727A (Table 2, row 5) was ineffective, 
another indication that the actions of this protein 
in the mitochondria are distinct from its activity 
as a transcription factor. Interestingly, the relative 
concentration of serine-phosphorylated Stat3 was 
highly enriched in mouse mitochondria as com- 
pared with that present in the cytoplasm (Fig. 1D). 


To confirm that MLS Stat3 and variants did 
not have transcriptional activity, we compared 
the expression of SOCS3, a Stat3-dependent gene, 
in cells incubated with or without IFNB (/4). 
Real-time polymerase chain reaction demon- 
strated that incubation of WT pro-B cells with 
IFNB induced an eightfold increase in SOCS3 
mRNA (fig. S3). However, Stat3“ cells or those 
cells expressing MLS Stat3 or MLS Stat3 with a 
mutation in its DNA binding domain showed no 
increase in SOCS3 expression (fig. $3). Further- 
more, cells that expressed constitutively active 
Stat3 in which respiration was not restored 
displayed robust IFN-stimulated expression of 
SOCS3 RNA. We obtained similar results when 
we examined IFNB-—induced expression of aden- 
osine deaminase | RNA in pro-B cells (fig. S3). 
These findings are another independent support 
of the concept that MLS Stat3 controls the ac- 
tivity of the electron transport chain, and this 
effect of Stat3 is unrelated to its actions as a 
transcription factor. 

To examine the role of Stat3 in mitochondrial 
respiration in vivo, we examined respiration in 
Stat3~ mouse hearts. Mice that do not express 
Stat3 in cardiomyocytes develop cardiac in- 
flammation with fibrosis, have dilated cardio- 
myopathy, and die prematurely of congestive 
heart failure (/5). Female mice that do not ex- 
press Stat3 in cardiomyocytes also develop post- 
partum cardiomyopathy, which is also seen in 
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Fig. 4. Decreased rates of O2 consumption in Stat3” heart mitochondria. Mitochondria isolated 
from littermates of WT (left) and Stat3~~ (right) hearts were incubated in an oxygen sensor 
chamber, and O2 consumption (y axis) as a function of incubation time (x axis) was recorded. In the 
upper panels, the mitochondria were incubated with pyruvate/malate (complex | substrates), and 
they were incubated with succinate (complex II substrate) in the lower panels. Different concen- 
trations of ADP were added to the mitochondria to measure state 3 respiration (0.2 mM ADP) or state 3 
Vinax fates of respiration (2 mM ADP). State 4 rates of respiration were calculated when 0.2mM ADP was 


depleted from the reaction. 
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humans with reduced Stat3 expression in the 
myocardium (/6). 

Mitochondria from the hearts of Stat3"e“"" 
(Stat3~") and Stat3"°*"°* (wild type) mice were 
assayed for complex I and II activity. The mice 
used for these studies were 8 weeks old, at which 
time the hearts are normal as analyzed by histol- 
ogy and various physiological parameters (/5). 
We used pyruvate and malate as complex I sub- 
strates (upper panels) and succinate as a complex 
II substrate (lower panels) to measure O, con- 
sumption in WT and Stat3~ heart mitochondria 
(Fig. 4). Rates of O2 consumption were de- 
creased in Stat3 mitochondria with both sub- 
strates. Maximal rates of state 3 respiration (Vinax) 
stimulated by the addition of 2 mM ADP were 
blunted in Stat3“~ mitochondria. Pyruvate and 
malate Vyax in state 3 was decreased by ~30% 
and succinate state 3 Vina, by ~60% (table S1). 
ETC assays confirmed that Stat3 “heart mitochon- 
dria had defects in complexes I and II but had 
normal activity of complex III (fig. S4). 

The results presented here reveal a role of 
Stat3 as a modulator of mitochondrial respira- 
tion. The most likely mechanism by which Stat3 


exerts its actions is not as a transcription factor 
that regulates nuclear gene expression, but rather 
through its localization in the mitochondria. The 
effects of Stat3 potentially represent a general 
mechanism by which this protein can influence 
cell metabolism. These studies are consistent 
with the concept of multidirectional communica- 
tion between the mitochondria and the nucleus 
and changes in homeostasis detected at the plas- 
ma membrane. The precise mechanism by which 
Stat3 regulates complexes I and II remains to be 
determined. 
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Platelets Kill Intraerythrocytic 
Malarial Parasites and Mediate 
Survival to Infection 
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Platelets play a critical role in the pathogenesis of malarial infections by encouraging the 
sequestration of infected red blood cells within the cerebral vasculature. But platelets also have 
well-established roles in innate protection against microbial infections. We found that purified 
human platelets killed Plasmodium falciparum parasites cultured in red blood cells. Inhibition of 
platelet function by aspirin and other platelet inhibitors abrogated the lethal effect human 
platelets exert on P. falciparum parasites. Likewise, platelet-deficient and aspirin-treated mice were 
more susceptible to death during erythrocytic infection with Plasmodium chabaudi. Both mouse 
and human platelets bind malarial-infected red cells and kill the parasite within. These results 
indicate a protective function for platelets in the early stages of erythrocytic infection distinct from 


their role in cerebral malaria. 


daptive immunity to malarial infection 
accumulates slowly over the lifetime of 
an individual living in an endemic region. 


Cross-immunity between isolates is low, and 
every new infection requires the development of 
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a virtually novel immune response. Therefore, 
innate mechanisms that limit parasite growth 
within the red blood cell (RBC) are extremely 
important in determining survival, especially 
during the first few years of life, because clinical 
severity correlates closely with parasite mass. 
Several known mutations affect the RBC and 
decrease the severity of infection through either 
impediment of parasite entry or development 
within the cell (7). However, we know little about 
other non-RBC mechanisms that may influence 
the course of infection before the onset of effective 
adaptive immune responses. 

Thrombocytopenia is a common clinical ac- 
companiment of Plasmodium falciparum (2-4), 
Plasmodium vivax (5, 6), and Plasmodium chabaudi 


(7) malarial infections. Low platelet concentra- 
tions correlate with increased parasite density (8, 9) 
and poor outcome (/0). Platelets bind preferen- 
tially to infected RBCs (//, 1/2) and have been 
postulated to play a role in the pathogenesis of 
malarial infection, either positively (13, 14) or 
negatively (/5, 16). Given the role platelets play 
in other infectious diseases (/7), we explored their 
influence on the course of a malarial infection in 
the mouse model, P. chabaudi, and in vitro using 
the cultured human malarial parasite, P_ falciparum. 

The megakaryocyte growth and differentiation 
factor, C-mpl, is the receptor for thrombopoietin 
and is encoded by the Mp/ gene. Homozygous dis- 
ruption of the Mp/ gene on a C57BL/6 background 
(Mpl “) results in mice with only one-tenth as 
many circulating platelets as their undisrupted coun- 
terparts have (/8). Platelet-deficient Mp!’ mice 
were significantly more susceptible to death from 
P. chabaudi than were isogenic C57BL/6 mice 
(P < 0.0001, Mantel-Cox log-rank test on sur- 
vival data), irrespective of well-known sex dif- 
ferences. In females, 5% of C57BL/6 versus 50% 
of Mp!’ mice died. Twenty percent of the male 
C57BL/6 mice and all the male Mp! * mice died 
(Fig. 1, A and B). 

Cohort analysis of groups of mice showed 
that platelet numbers decreased concomitantly 
with the appearance of parasites in the peripheral 
circulation (Fig. 1, E and F). Platelet numbers 
dropped to 10 to 20% of normal values in the 
C57BL/6 mice, with a smaller percentage drop in 
the MpI* mice. Platelet numbers reached a nadir 
before the onset of severe disease symptoms and 
anemia. No evidence of excess bleeding was seen 
in the Mp!“ mice during autopsy. Similar eryth- 
ropoietic kinetics occurred between Mp!’ and 
C57BL/6 mice during both malarial challenge 
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and experimentally induced hemolysis (figs. S1 
and S2), excluding differential erythropoiesis as a 
cause for the different susceptibilities. Nor were 
differences in immunological responses, including 
production of lymphocytes and inflammatory cyto- 
kines, observed between the mouse strains during 
infection (fig. S3). 

Acetyl salicylic acid (aspirin) inhibits cyclo- 
oxygenases I and II, resulting in a decrease in throm- 
boxane A2 and decreased platelet activation as 
seen by lower platelet aggregation (/9). Mice were 
given 25 mg of aspirin per kilogram of body 
weight intraperitoneally daily from the day before 
P. chabaudi infection until day 10 of infection and 
monitored for survival. CS57BL/6 and Mpl~~ mice 
of both sexes treated with aspirin were signifi- 
cantly more susceptible than placebo-treated mice 
(Fig. 1, C and D) (Cox proportional hazard 
analysis, P = 0.0009). Thus, pharmacological 
inhibition of platelet function increases suscepti- 
bility to malarial infection in mice, although other 
indirect effects of aspirin cannot be excluded. 

Synchronized cultures of P. falciparum troph- 
ozoites were grown in human RBCs and incubated 
with increasing concentrations of purified human 
platelets. A concentration-dependent increase in 
the inhibition of parasite growth was observed 
(Fig. 2A). Growth inhibition (measured after 44 
hours of incubation) peaked at 65% with 25 to 
75 million platelets/ml. Most viable parasites ob- 
served at this time were mature trophozoites and 
shizonts. Reductions in mature shizonts at 5 hours 
and new rings at 21 to 29 hours (fig. S4) suggest 
that platelets were inhibiting developmental matu- 
ration of the parasites. 

Using the P._ falciparum model, we investigated 
the dependence of parasite growth inhibition on 
platelet function by first incubating platelets with 
specific platelet antagonists. Maintenance of ele- 
vated intracellular adenosine 3’,5’-monophosphate 
and guanosine 3’,5’-monophosphate concentra- 
tions are key mechanisms in suppressing platelet 
activation (20, 2/). The addition of the adenylyl 
cyclase activator prostaglandin El completely 
abrogated the platelet growth-inhibitory activity. 
Exposure of platelets to the guanylyl cyclase 
activator, NO (donated by sodium nitroprusside), 
also suppressed platelet-mediated effects on par- 
asite growth (Fig. 2B). To test for the requirement 
of adenosine 5’-diphosphate (ADP), an impor- 
tant physiological regulator of platelet activation 
[reviewed in (20)], we incubated the platelet— 
P. falciparum cultures with adenosine diphos- 
phatase (ADPase) (22) to remove extracellular 
ADP from the culture medium. Unimpeded par- 
asite growth in the presence of platelets and 
ADPase was observed. ADP-dependent plate- 
let activation is mediated via two metabotrophic 
puronergic receptors, P2Y; and P2Y,5. Antago- 
nism of P2Y,, but not P2Y >, abolished the para- 
site killing activity (Fig. 2B). Thus, P falciparum 
growth is inhibited by platelets in a concentration- 
dependent and activation-dependent manner that 
requires ADP and activation through the P2Y, 
receptor. 


As with P. chabaudi in the mouse, inhibition 
of cyclooxygenase by aspirin also prevented the 
inhibition of parasite growth by platelets (Fig. 2B). 


To approximate a clinical effect, a volunteer platelet- 
donor took aspirin orally twice daily (4.2 mg/kg) 
for a week before donating blood for platelet 
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Fig. 1. Survival data and parasitological and hematological kinetics of P. chabaudi infection in C57BL/6 
and Mpl’ mice. (A and B) Survival curves for Mp’ (red) and C57BL/6 (black) mice infected with 
P. chabaudi and monitored for survival. (C and D) Effect of aspirin or vehicle-only treatment on survival of 
Mpl~ and C57BL/6 mice. Given that gender contributes a significant effect, the females (A and C) and 
males (B and D) have been separated. (E and F) Parasite and hematological parameters in C57BL/6 (E) 
and Mpl~“(F) mice. Parasitemias (percentage of infected RBCs), RBC counts (x10"/liter), and platelet 
counts (x107/liter). 
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Fig. 2. Platelet-mediated growth inhibition of P. falciparum cultured in human RBCs. (A) Parasite growth 
inhibition (relative to Tyrode’s buffer—treated cultures) with increasing concentrations of purified 
platelets. Chloro, parasites incubated with 640 nM chloroquine. (B) Parasite growth inhibition with 
platelets (50 million/ml) preincubated in platelet activation inhibitors at the indicated concentrations. 
Control, untreated platelets; PGE1, prostaglandin E1; SNP, sodium nitroprusside; MRS2179, P2Y, 
receptor antagonist; MRS2395, P2Y,2 receptor antagonist. (C) Inhibition of parasite growth in the 
presence of platelets (50 million/ml) preincubated with aspirin before addition to culture (in vitro aspirin) 
or untreated (control) compared with platelets taken from the same donor on separate occasions after 
treatment with aspirin (4.2 mg/kg taken orally twice daily for 1 week before platelet collection) (in vivo 
aspirin). All experiments were performed with washed platelets. Drugs used alone did not affect parasite 
growth. Bars for graphs represent the means of at least three independent experiments (A and B) and two 
independent experiments (C). Error bars indicate +1 SD. 
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purification. These platelets were unable to in- 
hibit growth of P. falciparum parasites (Fig. 2C), 
implying that aspirin is potentially harmful in ma- 


In mice infected with P. chabaudi for 8 days, 
with parasitemias at ~10 to 15%, platelets were 
preferentially bound to infected RBCs. As shown 
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Fig. 3. Infected RBCs are preferentially bound by platelets. (A) Giemsa-stained thin blood smears 
showing individual RBC-bound platelets (black arrows) [(i) and (ii), mouse; (iii) and (iv), human]. Some 
parasites look dysmorphic (red arrows) compared with normal parasites (blue arrows) (B) The percentage 
parasitemia (counted on Giemsa-stained thin smears) of all mouse RBCs compared to only those with 
platelets bound (P < 0.01). (C) The percentage CD41 staining of thiazole orange (TO) positive (infected) 
versus TO-negative (uninfected) mouse RBCs (P < 0.05) (D) Percentage of P. falciparum—infected and 
uninfected human RBCs bound by platelets counted on Giemsa-stained thin smears (P < 0.05). (E) 
Percentage of infected (TO-positive) and uninfected (TO-negative) human RBCs that stained positively for 
CD41 (P < 0.01). (B) and (C) refer to the in vivo mouse model (day 8 of infection) and (D) and (E) to the in 
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Fig. 4. Evidence that intraerythrocytic parasites are killed by platelets. (A) TUNEL staining of a 
P. chabaudi—infected C57BL/6 thin blood smear (day 7 of infection). (i) Hoechst staining of parasite DNA. 
(ii) TUNEL staining of parasite DNA, and (iii) both TUNEL and Hoechst staining superimposed on dark-field 
microscopy showing intraerythrocytic parasites. (B) Immunofluorescent images of P. falciparum- 
parasitized human RBCs cocultured with platelets for 24 hours. Separate images of cells stained with (i) 
Hoechst (all parasites), (ii) anti-CD41 (platelets), and (iii) TUNEL (dead parasites); (iv) merged image of (i), 
(ii), and (iii) (anti-CD41, red; TUNEL, green; Hoechst, blue). (C) The percentage of all P. chabaudi parasites 
staining positive with TUNEL in C57BL/6 and Mpl”~ mice (P < 0.05). (D) Percentage of TUNEL-positive 
parasites in P. falciparum cultures incubated with platelets (50 million/ml) and without platelets for 
24 hours (P <0.05). (E) Percentage of P. falciparum—parasitized RBCs bound by platelets that stained 
positively or negatively with TUNEL stain (P < 0.05). Bars for graphs represent the mean of at least three 
independent experiments with error bars representing +1 SD. 
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copy (Figs. 3A, i and ii), the percentage parasit- 
emia was twice as high when only platelet-bound 
RBCs were counted compared with all RBCs 
(Fig. 3B). CD41 (a platelet surface antigen) on 
the surface of RBCs served as a marker for plate- 
let binding. Three times as many infected RBCs 
stained for CD41 as did uninfected RBCs (Fig. 
3C). Similar results were obtained from cultured 
P. falciparum, where twice the numbers of infected 
RBCs were bound by human platelets (counted 
on Giemsa-stained thin smears) (Fig. 3A, ii and 
iv, and 3D) and twice the number of infected 
RBCs were also CD41 positive (Fig. 3E). There- 
fore, platelets selectively bind infected RBCs. 

The presence of dead parasites in the pe- 
ripheral circulation was investigated by adapting 
the terminal deoxynucleotidyl transferase deoxy- 
uridine triphosphate nick end labeling (TUNEL) 
technique to thin blood smears. TUNEL stains 
nuclei containing sheared or degraded DNA, an 
accompaniment of apoptosis or necrosis. As a 
positive control, all extraerythrocytic parasites, 
which commonly occur late in P. chabaudi in- 
fections, stained positive for TUNEL (fig. S5). In 
mice with lower parasitemias (day 7), TUNEL 
staining of intraerythrocytic parasites was evident 
(Fig. 4A), indicative of intraerythrocytic parasite 
death. 

Twice as many dead intraerythrocytic para- 
sites were seen in C57BL/6 mice as in Mpl 
mice (Fig. 4C). Although no differences in total 
parasitemia between Mpl ‘ and C57BL/6 were 
detected at this time point (most TUNEL-stained 
parasites are morphologically indistinguishable 
from live parasites), parasitemias measured at later 
time points were significantly higher in Mpl 
than in CS7BL/6 mice (fig. S6). This suggests that 
platelet-mediated parasite killing influences para- 
site numbers past the platelet nadir, resulting in 
higher parasitemias in Mp!’ mice, and explains 
their increased susceptibility. 

TUNEL-stained intracellular parasites were 
also observed in the P. falciparum cultures (Fig. 
4B). The frequency of stained parasites was more 
than three times as high in platelet-treated cultures 
(Fig. 4D). Significantly higher proportions of 
TUNEL-stained than nonstained infected RBCs 
were bound by human platelets (by staining with 
antibodies to CD41) (Fig. 4E). The greater num- 
bers of dead intraerythrocytic parasites in the 
presence of normal numbers of platelet in vivo 
and in vitro implies that platelets directly kill 
malarial parasites. 

Increased mortality in platelet-deficient mice 
and direct killing of P falciparum by human 
platelets indicate that platelets are important in 
controlling malarial infection. Thrombocytopenia 
occurs early in infection in both humans and mice, 
before severe forms of the disease develop (3, 6). 
Therefore, the major protective effect of platelets 
will be early in infection. Modulation of parasite 
growth by platelets will buffer the parasite growth 
rate and allow the engagement of the adaptive 
immune response. This finding is in addition to 
the well-established role for platelets in cerebral 
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disease (15, 23, 24). We have also shown that 
inhibition of platelet activation abrogates the 
protective effect, which could explain the delete- 
rious effect aspirin may have on malarial out- 
come (25). 
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Changes in Cortical Dopamine D1 
Receptor Binding Associated with 


Cognitive Training 


Fiona McNab,’ Andrea Varrone,” Lars Farde,”? Aurelija Jucaite,“? Paulina Bystritsky,* 


Hans Forssberg,* Torkel Klingberg** 


Working memory is a key function for human cognition, dependent on adequate dopamine 
neurotransmission. Here we show that the training of working memory, which improves working 
memory capacity, is associated with changes in the density of cortical dopamine D1 receptors. 
Fourteen hours of training over 5 weeks was associated with changes in both prefrontal and 
parietal D1 binding potential. This plasticity of the dopamine D1 receptor system demonstrates a 
reciprocal interplay between mental activity and brain biochemistry in vivo. 


orking memory (WM) is the ability to 
retain information for short periods of 
time and is important for a wide range 
of cognitive functions (/, 2). Reduced WM ca- 
pacity is associated with neurological and psy- 
chiatric disorders (3, 4) as well as normal aging 
(5). Several of these conditions are also associ- 
ated with impaired dopamine transmission (6, 7). 


Intensive training on WM tasks can improve 
WM capacity (S—/2) and reduce cognitively re- 
lated clinical symptoms (/0). Training-related 
improvements in WM have been associated 
with an increase in brain activity in parietal and 
frontal regions linked to WM (9), but the bio- 
chemical underpinnings of cognitive training are 
unknown. 


Dopaminergic neurotransmission has a cen- 
tral role in WM performance (/3—/6), and cor- 
tical dopamine release has been observed in 
humans during the performance of WM tasks 
(/7). In nonhuman primates, locally applied D1 
agonists, as well as antagonists, affect both per- 
formance and the neuronal firing patterns of 
prefrontal neurons when information is kept in 
WM (J8, 1/9). The effects seem to be dose- 
dependent (/5, 76), with evidence of an optimal 
level, so that either too much or too little stimu- 
lation of D1 receptors results in reduced WM per- 
formance or tuning of prefrontal activity (18—2/). 

The availability of dopamine can lead to the 
translocation of dopamine D1 receptors from the 
cytosol to the plasma membrane (22), and down- 
regulation of striatal dopamine D2 receptors has 
been shown to occur after 7 days of motor 
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Fig. 1. Maps of baseline D1 and D2 BP, averaged across 13 human 
volunteers. (A) The averaged MRI, normalized to MNI space. (B) D1 BP, 
measured with [**C]SCH23390, averaged across participants (the bar 


shows absolute D1 BP). (C) Overlay of (B) on (A). (D) D2 BP, measured 
with [**C]Raclopride, averaged across participants (the bar shows 
absolute D2 BP). (E) Overlay of (D) on (A). 
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training in developing rats (23). However, the 
Right postenor Bapeonseert ae : regulation of dopamine receptors as a result of 

26 ‘ : : cognitive training has not been studied. We thus 
investigated the possibility that up- or down- 
regulation of cortical D1 receptors and subcortical 
D2 receptors is associated with intensive mental 
activity during cognitive training. 

We used a previously described method of 
WM training in which participants perform WM 
tasks with a difficulty level close to their individ- 
ual capacity limit for about 35 min per day over a 
period of 5 weeks (8/0). Thirteen volunteers 
(healthy males 20 to 28 years old) performed the 
5-week WM training. Five computer-based WM 
tests (three visuospatial and two verbal) were 
used to measure each participant’s WM capacity 
before and after training, and they showed a 
significant improvement of overall WM capacity 
(paired ¢ test, t= 11.1, P < 0.001). The binding 
potential (BP) of D1 and D2 receptors was 
measured with positron emission tomography 
(PET) while the participants were resting, be- 
fore and after training, using the radioligands 
[''C]SCH23390 and ['!C]Raclopride, respectively 
(Fig. 1). 

To identify brain regions implicated in WM, 
we conducted functional magnetic resonance 
imaging (fMRI) on each individual. By compar- 
ing activity during a WM task to that during a 
control task, we identified regions specifically 
linked to WM (P < 0.05, false discovery rate 
corrected). This resulted in five regions of interest 
(ROIs) (Fig. 2, A to E), which were used to 
constrain the analysis of the D1 BP as follows: 
(i) A right posterior ROL, which included regions 
of the right parietal, temporal, and occipital corti- 
ces; (ii) a left posterior ROI, which included re- 
gions of the left parietal, temporal, and occipital 
cortices; (iil) a right dorsolateral prefrontal ROI, 
which included the right middle frontal gyrus and 
right superior frontal gyrus; (iv) a left frontal 
ROI, which included the left middle frontal 
gyrus; and (v) a right ventrolateral prefrontal 
ROI, which included the right inferior frontal 
gyrus. For calculation of D2 BP, bilateral caudate 
and putamen ROIs were defined anatomically. 
Although WM activity in the basal ganglia was 
not identified from the fMRI data in the present 
study, these regions have previously been asso- 
ciated with WM (//, 24) and are known to have 
a high density of D1 and D2 receptors (Fig. 1D). 
Based on suggestions of an inverted u-shape re- 
lationship between DA levels and performance, 
we analyzed the outcome using both linear 
(WM = 0. + £8, BP) and quadratic (WM = a + 
6, BP + BBP?) regression models (where a is 
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Fig. 2. (A to E) The five posterior (red) and frontal (green) cortical ROIs, identified from the fMRI results : é 
(the contrast of activity recorded during the WM task minus activity recorded during a control task) and the intercept and B; and B» are the regression 
used to constrain the analysis of D1 BP. The blue lines indicate where the axial and coronal planes ©0Cfficients). 

intersect. (F to J) The application of the quadratic model for the analysis of change in WM capacity for First We averaged baseline D1 BP ares the 
each of the ROIs. The x axis shows D1 BP before training, the y axis the D1 BP after training, and the z axis five cortical ROIs and averaged baseline D2 BP 
the improvement in WM. The colored surface represents predicted values, with warmer colors representing in the four subcortical ROIs, then analyzed the 
higher values on the z axis. The black circles represent the observed data. This model predicted change _‘relationship with overall WM capacity before 
in WM capacity in all except the left frontal ROI [right posterior: F(2,10) = 4.87, P = 0.033; left posterior: training. There was no significant association for 
F(2,10) = 4.82, P = 0.034; right dorsal frontal: F(2,10) = 7.19, P = 0.012; left frontal: F(2,10) = 0.71, either D1 or D2 BP (D1: linear r= 0.09, P = 
P = 0.515; right ventral frontal: F(2,10) = 5.38, P = 0.026]. 0.33; quadratic 7° = 0.34, P= 0.12 for the whole 
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model and P= 0.08 for the second-order term, B>, 
suggesting a trend for an inverted-u shape; D2: 
linear 7 = 0.11, P= 0.26; quadratic 7° = 0.12, P= 
0.53). Next we investigated the effect of training. 
For each participant, D1 BP change was aver- 
aged across the five ROIs. The change in WM 
capacity could be explained by both the linear 
model (negative correlation, P = 0.016) and the 
quadratic model (P = 0.001). However, the qua- 
dratic model {WM> — WM, = [a + B, BP» + 
B(BP3)"]—[a +B, BP; +B2(BP,) ], where WM, 
and WM; represent WM capacity before and after 
training, respectively; and BP; and BP? represent 
BP before and after training, respectively} pre- 
dicted a larger amount of variance (r° = 0.75) as 
compared to the linear model (r? = 0.42; r? of 
change between models = 0.33, P = 0.005). The 
quadratic model was then fitted for each region 
individually and described the data at a statisti- 
cally significant level (P < 0.05) for the right ven- 
trolateral frontal, right dorsolateral frontal, and 
both posterior ROIs (Fig. 2, F to J). For D2 BP, 
the average change across all ROIs was not re- 
lated to the change in WM capacity (linear model: 
r= 0.02, P = 0.67; quadratic model: r= 0.08, 
P= 0.66). 

These findings show that training-related 
changes in WM capacity are associated with changes 
in D1 BP. The binding of [''C]SCH23390 has 
been shown to be insensitive to the immediate 
effect of drugs changing endogenous dopamine 
concentration and may thus serve as an index 
for the density of available D1 receptors (25). 
Although the relation between performance and 
dopamine BP is probably nonlinear, our data 
(Fig. 2, F to J) generally showed that, within the 
measured range, a negative correlation domi- 
nated for all regions, with larger decreases in D1 
BP being associated with larger improvements in 
WM. This is consistent with the finding that low 
doses of a D1 antagonist enhance the delay activ- 
ity of prefrontal neurons during the performance 


of WM tasks (/8, 79). An association between a 
decrease in BP and an increase in WM is also 
consistent with the negative correlation observed 
between WM capacity and D1 binding in indi- 
viduals with schizophrenia (26). 

The underlying mechanisms responsible for 
the plasticity of receptor densities are not known. 
One possibility is that other transmitters influence 
the trafficking of dopamine receptors; for exam- 
ple, it has been shown that the activation of N- 
methyl-p-aspartate receptors affects dopamine 
signaling by recruiting D1 receptors from the in- 
terior of the cell to the plasma membrane (27). 
Another interpretation is that the changes reflect 
long-term adjustment of the concentration of D1 
receptors in response to a prolonged increase in 
the level of endogenous dopamine during WM 
training. 

More generally, the present results demon- 
strate a high level of plasticity of the neuronal 
system defined by cortical D1 receptors in human 
volunteers. The findings were specific because 
the D2 system did not show any relation to WM 
changes. The training-induced changes empha- 
size the reciprocal interplay between behavior 
and the underlying brain biochemistry and 
should be relevant for studies of neuropsychiatric 
disorders as well as correlational studies between 
behavior and biochemical markers. 
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Axon Regeneration Requires a 
Conserved MAP Kinase Pathway 


Marc Hammarlund,”2*+ Paola Nix,?+ Linda Hauth,* Erik M. Jorgensen,”’? Michael Bastiani?+ 


Regeneration of injured neurons can restore function, but most neurons regenerate poorly or 
not at all. The failure to regenerate in some cases is due to a lack of activation of cell-intrinsic 
regeneration pathways. These pathways might be targeted for the development of therapies that 
can restore neuron function after injury or disease. Here, we show that the DLK-1 mitogen- 
activated protein (MAP) kinase pathway is essential for regeneration in Caenorhabditis elegans 
motor neurons. Loss of this pathway eliminates regeneration, whereas activating it improves 
regeneration. Further, these proteins also regulate the later step of growth cone migration. 

We conclude that after axon injury, activation of this MAP kinase cascade is required to 

switch the mature neuron from an aplastic state to a state capable of growth. 


evered neurons can regenerate. After axons 
are cut, neurons can extend a new growth 
cone from the axon stump and can attempt 
to regrow a normal process. Most invertebrate 


neurons are able to regenerate, as are neurons in 
the mammalian peripheral nervous system. By 
contrast, neurons in the mammalian central ner- 
vous system have limited regenerative capability 


(/). Regeneration is thought to be initiated by 
signals arising from the injury, including calcium 
spikes and the retrograde transport and nuclear 
import of regeneration factors (2). These mech- 
anisms lead to increased cyclic adenosine mono- 
phosphate (cAMP) levels, local and somatic protein 
synthesis, and changes in gene transcription that, 
in turn, promote remodeling of the cytoskeleton 
and plasma membrane at the site of injury. The 
ability of specific neurons to regenerate is deter- 
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mined in part by the balance between proregen- 
eration signals and cellular pathways that inhibit 
regeneration. For example, regeneration in the 
mammalian CNS is inhibited by extrinsic signals 
from myelin and chondroitin sulfate proteogly- 
cans; these signals activate pathways in the dam- 
aged neuron that prevent regrowth (3). But CNS 
regeneration can be achieved even in the pres- 
ence of inhibitory signals. A conditioning le- 
sion to a peripheral process results in increased 
regeneration of the CNS branch of dorsal root 
ganglion neurons, presumably by triggering in- 
jury signals that result in an overall increase in 
regenerative potential (4). Thus, intrinsic re- 
generation signals can influence regenerative 
success, and these signaling processes repre- 
sent potential targets for therapies to enhance 
regeneration. 

We identified the mitogen-activated protein 
(MAP) kinase kinase kinase (MAPKKK) d/k-/ 
as essential for regeneration in the course of a 
large screen for genes required for regeneration. 
This screen was conducted in a B-spectrin mutant 
(unc-70) background (5). Neurons in B-spectrin 
mutant nematodes break because of mechanical 
strain induced by locomotion. The y-aminobutyric 
acid (GABA)-releasing motor neurons respond 
to breaks by regenerating toward their targets 
in the dorsal cord (6). Axon guidance during 
regeneration is imperfect, resulting in axons in 
mature animals with branching and other abnor- 
malities (Fig. 1). We found that RNA interference 
of dik-/ eliminates regeneration in unc-70 mutant 
animals. Neither unc-70 (6) nor dik-I (7) is es- 
sential for axon outgrowth during development 
of the GABA neurons (Fig. 1) (8). The unc-70 
dlk-1 synthetic phenotype for axon morphology 
suggests that d/k-] may function specifically in 
regeneration. 

To demonstrate that d/k-/ functions in regen- 
eration independently of unc-70, we used laser 
axotomy to trigger regeneration. The GABA motor 
neurons can regenerate after laser axotomy (9). 
We used a pulsed 440-nm laser to cut axons (/0). 
In larval stage 4 (L4) wild-type animals, 70% 
of severed axons initiated growth cones within 
24 hours after axotomy (Fig. 2 and table S1). But 
when axons were cut in L4 stage dlk-1(ju476) 
null mutants, growth cones were never observed. 
These severed neurons appeared healthy after sur- 
gery; both the stump of the remaining axon and 
its cell body showed no decrease in green fluo- 
rescent protein (GFP) expression or other signs of 
injury. Nevertheless, these neurons failed to re- 
generate. To test whether regeneration was mere- 
ly delayed, we monitored some severed axons for 
5 days; regeneration still was not observed. Thus, 
dlk-1 is essential for axon regeneration after spon- 
taneous breaks and after laser surgery but is dis- 
pensable for axon outgrowth during development. 

Mosaic experiments demonstrate that the 
DLK-1 protein acts in the damaged cell rather 
than in the surrounding tissue. To determine 
whether DLK-1 acts cell-autonomously to pro- 
mote regeneration, we expressed d/k-/ under the 


GABA-specific promoter Punc-47 in the dlk-1 
null background. Neurons were severed by laser 
surgery, and regeneration was assayed after 18 to 
24 hours. We found that expressing DLK-1 in the 
GABA neurons restored regeneration to d/k-1 
null mutants (Fig. 2C). Further, mutations that af- 
fect DLK-1 levels also affect regeneration (Fig. 2D). 
In Caenorhabditis elegans, DLK-1 levels are neg- 
atively regulated by RPM-1 (7). Overexpression 
of RPM-1 reduced regeneration after surgery to 
levels similar to those of d/k-/ loss-of-function 
mutants. Conversely, initiation of regeneration was 
enhanced in rpm-/ mutant animals. Initiation of 
regeneration was also enhanced in animals lack- 
ing FSN-1, an F-box protein that functions with 
RPM-1 to promote DLK-1 degradation (/7). How- 
ever, loss of GLO-1 (Rab) or GLO-4 (Rab guanine 
nucleotide exchange factor), both of which me- 
diate ubiquitin-independent functions of RPM-1 
(11), did not have strong effects on regeneration. 
These results confirm that changes in DLK-1 
protein abundance can determine regenerative 
ability. 

Although regeneration is age-dependent, d/k-1 
is required at all stages, and overexpression of 
DLK-1 can rescue some age-dependent decline 
(Fig. 2E). We analyzed regeneration at various 
developmental stages. We found that regeneration 
declines significantly with age, and only a few 
axons in old adults regenerated (/0). Despite 
these differences, d/k-1 is required for all regener- 


Fig. 1. dlk-1 is required for axon 
regeneration in B-spectrin mutant 
nematodes. (A) Cartoon showing de- 
velopment of axon morphology in 
control B-spectrin mutant (left) and 
in B-spectrin mutant lacking hypo- 
thetical regeneration gene, e.g., 
dlk-1 (right). (B and C) GABA neu- 
rons in representative L4 stage B- 
spectrin mutants (unc-70), expressing 
green fluorescent protein, under con- 
trol conditions or after dlk-1 RNA 
interference. Scale bars, 20 um. (D) 
High-magnification view of boxed 
region in (C). Arrows indicate inert 
axon stumps. Scale bar, 10 um. (E 
and F) GABA neurons in represen- 
tative L4 stage wild-type and dlk-1 
mutant animals. Scale bars, 20 um. 


ation, even in very young animals. Because growth 
cones in the very young animals had not yet 
reached their targets, the dependence on d/k-] 
is not correlated with target contact or synapto- 
genesis. Further, d/k-/ is required for regen- 
eration of both presynaptic and postsynaptic 
processes (8). 

To mediate regeneration, DLK-1 is required 
at the time of injury (Fig. 2F). We expressed the 
DLK-1 protein at different times using the heat 
shock promoter Phsp-16.2. DLK-1 expression at 
the time of injury was sufficient for regeneration. 
Applying heat shock hours before or hours after 
surgery resulted in less regeneration, and when 
heat shock was applied either 11 hours before or 
48 hours after surgery, little or no regeneration 
was observed. This effect was independent of age, 
because surgery in L2 stage larvae failed to elicit 
regeneration when heat shock was applied 48 hours 
later. Thus, DLK-1 must function within a short 
temporal window near the time of injury to me- 
diate regeneration, rather than establishing a per- 
missive state for regeneration during development. 
These data suggest that DLK-1 signaling must co- 
incide with other proregeneration signals, such as 
calpain activation (/2) or cAMP elevation (/3—/5), 
for regeneration to occur. 

DLK-1 is required for growth cone formation 
rather than the earlier step of filopodial extension. 
We used time-lapse microscopy to monitor mor- 
phological changes in axons after surgery. We 
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found that in wild-type animals, newly severed 
axons repeatedly extend short, transient filopodia 
from the axon stump (Fig. 3 and movie S1). The 
first filopodium appears with an average delay of 
more than 3 hours. In animals that successfully 
initiate regeneration, a single filopodium eventu- 
ally persists and is transformed into a growth cone. 
Growth cone formation in wild-type animals oc- 
curs with an average delay of 7 hours after sur- 
gery. In d/k-] mutants, transient filopodia appear 
at approximately the same time and the same rate 
as in the wild type. However, growth cones were 
never observed in these mutants. These data dem- 
onstrate that DLK-1 is required to transform ex- 
ploratory filopodia into growth cones. 

Increased expression of DLK-1 in wild-type 
animals accelerates the formation of growth cones 
and improves migration success. We overexpressed 
DLK-1 in GABA neurons and found that time of 
growth cone initiation by axon stumps was ad- 
vanced relative to the wild type (Fig. 3 and movie 
S1). Also, more axons initiated growth cones 
(Fig. 4). In addition to these effects on growth 
cone initiation, DLK-1 overexpression improved 
growth cone performance. In wild-type animals, 
regenerating growth cones often have a branched, 
dystrophic morphology. Dystrophic growth cones 
migrate poorly, and most never reach the dorsal 
nerve cord in 24 hours (Fig. 3). These dystrophic 
growth cones resemble dystrophic growth cones 
observed in failed regeneration in the mammalian 
CNS (/6). By contrast, regenerating growth cones 
in neurons that overexpress DLK-1 have a com- 
pact shape, similar to growth cones observed dur- 


Fig. 2. dlk-1 is required in severed axons for 
growth cone initiation. (A) Regenerating axons 
18 to 20 hours after laser surgery in a wild- 
type animal. Both severed axons have gener- 
ated a growth cone (arrows). Scale bar, 10 um. 
(B) Axons in dlk-1 mutants fail to generate 
growth cones 18 to 24 hours after surgery. 
Scale bar, 10 um. (C) DLK-1 acts cell intrin- 
sically to mediate regeneration. (D) RPM-1 
controls DLK-1 activity in axon regeneration. 
(E) Regeneration requires d/k-1 at all ages and 
overexpression of DLK-1 rescues age-associated 
decline. DD and VD are motor neurons of dif- 
ferent lineages (8). (F) DLK-1 acts at the time 
of injury to mediate regeneration. Time of heat 
shock relative to surgery is indicated in hours. 
No heat shock is indicated by "no." "L2" in- 
dicates surgery at the L2 stage. Axotomy at L4 
stage unless indicated otherwise. (C) to (F) 
Percentage of axons that initiated regener- 
ation and 95% confidence interval (Cl). *P < 
0.05, **P < 0.01, ***P < 0.001. 


ing initial axon development (/7). These compact 
growth cones were much more likely to reach the 
dorsal nerve cord. Growth cone migration during 
regeneration in C. elegans shares some genetic 
requirements with developmental axon guidance, 
including components of the netrin and slit sig- 
naling pathways (/8) and the ephrin pathway (/0). 
It is possible that DLK-1 overexpression improves 
regeneration by affecting the response of the 
growth cone to such signals. Thus, DLK-1 acts at 
two steps of regeneration: It is required for growth 
cone formation, and it also controls growth cone 
morphology and behavior. 

DLK-1 functions in a MAP kinase signaling 
cascade that also includes the MAP kinase kinase 
(MAPKK) MKK-4, and the p38 MAP kinase 
PMK-3 (7). We tested whether this entire MAP 
kinase signaling module functions in regeneration 
by examining null mutants in mkk-4 and pmk-3. 
Like dlk-/, neither of these mutants has apprecia- 
ble defects in axon outgrowth during develop- 
ment. But after axotomy, both mutant strains fail 
to initiate regeneration (Fig. 4A). These data sug- 
gest that MKK-4 and PMK-3 are the downstream 
targets of DLK-1 for regeneration. Inhibition of 
p38 also reduces regeneration of cultured verte- 
brate neurons (/9), which suggests that the func- 
tion of p38 MAP kinases in regeneration is 
conserved. Do other MAP kinase cascades also 
contribute to regeneration? We tested a sampling 
of C. elegans MAP kinase components and found 
that mutations in these genes did not eliminate 
regeneration (Fig. 4B and table S1). Initiation of 
regeneration was not affected by loss of the 
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MAPKKK usy-/ or its target MAPKK sek-/. 
Loss of the MAPKK jkk- also did not affect 
regeneration. By contrast, loss of the MAPKKK 
milk-1 reduced initiation of regeneration (although 
some regeneration still occurred), as did loss of 
its downstream target mek-/. MLK-1 and MEK-1 
are thought to activate a second C. elegans 
p38 MAP kinase, PMK-1 (20), which suggests 
that multiple p38 family members contribute to 
regeneration. (Because null mutations in pmk-1 
are lethal, we were unable to test its function 
directly.) Loss of the MAP kinase jnk-1 in- 
creased initiation of regeneration. Thus, whereas 
the DLK-1/MKK-4/PMK-3 MAP kinase cas- 
cade is required to initiate regeneration, other 
MAP kinase pathways also regulate this pro- 
cess. Consistent with these data, mutations in 
mkk-4 or pmk-3 did not eliminate the stimula- 
tion of regeneration by DLK-1 overexpression, 
which suggests that cross-talk between MAP 
kinase modules may contribute to regeneration 
(Fig. 4C). However, the modest phenotype of 
other MAP kinase mutants and the inability of 
DLK-1 overexpression to bypass the requirement 
for mkk-4 and pmk-3 suggest that the DLK-1/ 
MKK-4/PMK-3 module is the major MAP ki- 
nase pathway for axon regeneration. 

What stimulates DLK-1 function when an 
axon breaks? In the simplest model, the axon 
break interrupts trafficking of DLK-1. Local 
DLK-1 accumulation in the injured neuron then 
leads to homodimerization and activation, fol- 
lowed by activation of the downstream targets 
MKK-4 and PMK-3 (Fig. 4D). Alternatively, 
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specific regulatory mechanisms activate DLK-1 
after injury, such as scaffolding proteins like Jip1 
(21); phosphatases such as PP1, PP2a, and cal- 
cineurin (22); or regulators of the proteasome (23). 
The strict requirement for the DLK-1 pathway in 
regeneration suggests that mature neurons have 
intrinsic barriers to growth that are not present 
during development. Once axons have reached 
their target and have begun synaptogenesis, ter- 
mination of growth signals by mechanisms like 
RPM-1 may down-regulate growth to allow syn- 
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apse maturation and to stabilize neuronal architec- 
ture. Indeed, mutations in RPM-1 or its homologs 
cause overgrowth of axons in worms (24), aber- 
rant sprouting in Drosophila (25), and aberrant 
growth cone initiation on axon shafts in mouse 
(26). We found that d/k-1 is required for regen- 
eration even in neurons that are actively growing 
at the time of injury. Further, overexpressing 
DLK-1 partially prevented the loss of regenera- 
tion in old animals (Fig. 2E). Thus, barriers to 
growth are quickly erected in axons, and post- 


injury signaling via DLK-1, MKK~4, and PMK-3 
is required to drive the neuron back to its pre- 
lapsarian, embryonic state. 

How does the MAP kinase PMK-3 stimulate 
regeneration? The DLK-1 pathway is first required 
for growth cone formation about 7 hours after a 
break occurs—a process likely to be mediated by 
the polymerization of microtubules. Activated p38 
MAP kinase regulates microtubule dynamics (26), 
and microtubule remodeling is required for growth 
cone initiation during regeneration (27). Further, 
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10 uum. (E) Representative growth cones in a wild-type animal. Although these axons suc- 
cessfully initiated regeneration, the growth cones (arrows) have a dystrophic morphology. 
Scale bar, 10 im. (F) Representative growth cones in an animal overexpressing DLK-1 under 
the unc-47 promoter. These growth cones (arrows) have a compact morphology similar to 
growth cones observed during development. Scale bar, 10 um. (G) Distribution of all times 
of filopodia initiation in wild type and dlk-1. Each dot represents a filopodium. (H) Time of 
first filopodium initiation in wild type and dlk-1. Means + SEM. (1) Rate of filopodia initiation 


in wild type and dlk-1. Means + SEM. (J) Time to initiate regeneration after surgery in wild 
type and dlk-1 overexpressing (OE) animals. Initiation is defined as the appearance of the filopodia that becomes a growth cone. Each dot represents a single axon. 
(K) Percentage of wild-type and dlk-1 overexpressing (OE) regenerating axons that reached the dorsal cord after 18 to 24 hours. Error bars indicate 95% Cl. 
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Fig. 4. MAP kinase signal- 
ing is required for axon re- 
generation. (A) Regeneration 
is eliminated by mutations 
in the DLK-1/MKK-4/PMK-3 
MAP kinase module. (B) Other 
MAP kinase elements con- 
_ tribute to regeneration, but 
are not essential. (C) Activated 
DLK-1 has targets in addi- 
tion to MKK-4 and PMK-3. 
(D) Model for function of 
MAP kinase signaling dur- 
ing axon regeneration. (A) 
to (C) Error bars indicate 
95% Cl. 
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defects in microtubule dynamics contribute to the 
axon outgrowth phenotype of Phr/ mutant mice 
(26). Activated p38 may also control other targets 
that facilitate axon regeneration. p38 regulates 
local protein synthesis (28), which is required for 
regeneration (/9). p38 is also likely to have func- 
tions in the nucleus, because it contributes to 
injury-induced changes in gene transcription (29). 
Activated p38 may reach the nucleus by retro- 
grade transport. Retrograde transport in general is 
critical for regeneration (30), and transport of ac- 
tivated MAP kinases from axons to the cell body 
following axotomy has been observed in Aplysia 
sensory neurons (3/) and in rodent sciatic nerves 
(32, 33). Thus, regeneration may require acti- 
vated PMK-3/p38 at the site of the break to reg- 
ulate microtubule stability and protein expression 
and also may require PMK-3 to traffic to the nu- 
cleus to regulate gene transcription (Fig. 4D). 
The DLK-1 signaling pathway thus provides a 
critical link between axon injury and the process 
of regeneration. 
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